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<210> 1 

<211> 1209 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (33) . . (854) 

<220> 

<221> sig_peptide 
<222> (33) . . (89) 

<220> 

<221> misc_f eature 
<222> (693) . . (755) 

<223> predicted transmembrane domain 
<400> 1 

cagaaagaga cctatatgat caaatacaga ac atg ate ttc etc ctg eta atg 53 

Met lie Phe Leu Leu Leu Met 
1 5 

ttg age ctg gaa ttg cag ctt cac cag ata gca get tta ttc aca gtg 101 
Leu Ser Leu Glu Leu Gin Leu His Gin lie Ala Ala Leu Phe Thr Val 
10 15 20 

aca gtc cct aag gaa ctg tac ata ata gag cat ggc age aat gtg ace 149 
Thr Val Pro Lys Glu Leu Tyr lie lie Glu His Gly Ser Asn Val Thr 
25 30 35 

ctg gaa tgc aac ttt gac act gga agt cat gtg aac ctt gga gca ata 197 
Leu Glu Cys Asn Phe Asp Thr Gly Ser His Val Asn Leu Gly Ala lie 
40 45 50 55 

aca gee agt ttg caa aag gtg gaa aat gat aca tec cca cac cgt gaa 245 



1 



# 



Thr Ala Ser Leu Gin Lys Val Glu Asn Asp Thr Ser Pro His Arg Glu 



aga gcc act ttg ctg gag gag cag ctg ccc eta ggg aag gec teg ttc 293 
Arg Ala Thr Leu Leu Glu Glu Gin Leu Pro Leu Gly Lys Ala Ser Phe 
75 80 85 

cac ata cct caa gtc caa gtg agg gac gaa gga cag tac caa tgc ata 341 
His He Pro Gin Val Gin Val Arg Asp Glu Gly Gin Tyr Gin Cys He 
90 95 100 

ate ate tat ggg gtc gcc tgg gac tac aag tac ctg act ctg aaa gtc 389 
He He Tyr Gly Val Ala Trp Asp Tyr Lys Tyr Leu Thr Leu Lys Val 
105 HO H5 

aaa get tec tac agg aaa ata aac act cac ate eta aag gtt cca gaa 4 37 
Lys Ala Ser Tyr Arg Lys He Asn Thr His He Leu Lys Val Pro Glu 
120 125 130 135 

aca gat gag gta gag etc ace tgc cag get aca ggt tat cct ctg gca 485 
Thr Asp Glu Val Glu Leu Thr Cys Gin Ala Thr Gly Tyr Pro Leu Ala 
140 145 150 

gaa gta tec tgg cca aac gtc age gtt cct gcc aac ace age cac tec 533 
Glu Val Ser Trp Pro Asn Val Ser Val Pro Ala Asn Thr Ser His Ser 
155 160 165 

agg ace cct gaa ggc etc tac cag gtc ace agt gtt ctg cgc eta aag 581 
Arg Thr Pro Glu Gly Leu Tyr Gin Val Thr Ser Val Leu Arg Leu Lys 
170 175 180 

cca ccc cct ggc aga aac ttc age tgt gtg ttc tgg aat act cac gtg 629 
Pro Pro Pro Gly Arg Asn Phe Ser Cys Val Phe Trp Asn Thr His Val 
185 190 195 

agg gaa ctt act ttg gcc age att gac ctt caa agt cag atg gaa ccc 677 
Arg Glu Leu Thr Leu Ala Ser He Asp Leu Gin Ser Gin Met Glu Pro 
200 205 210 215 

agg acc cat cca act tgg ctg ctt cac att ttc ate ccc tec tgc ate 725 
Arg Thr His Pro Thr Trp Leu Leu His He Phe He Pro Ser Cys He 
220 225 230 

att get ttc att ttc ata gcc aca gtg ata gcc eta aga aaa caa etc 773 
He Ala Phe He Phe He Ala Thr Val He Ala Leu Arg Lys Gin Leu 
235 240 245 

tgt caa aag ctg tat tct tea aaa gac aca aca aaa aga cct gtc acc 821 
Cys Gin Lys Leu Tyr Ser Ser Lys Asp Thr Thr Lys Arg Pro Val Thr 
250 255 260 

aca aca aag agg gaa gtg aac agt get ate tga acctgtggtc ttgggagcca 874 
Thr Thr Lys Arg Glu Val Asn Ser Ala He 
265 270 

gggtgacctg atatgacatc taaagaagct tctggactct gaacaagaat tcggtggcct 934 



2 



gcagagcttg ccatttgcac ttttcaaatg cctttggatg acccagcact ttaatctgaa 994 
acctgcaaca agactagcca acacctggcc atgaaacttg ccccttcact gatctggact 1054 
cacctctgga gcctatggct ttaagcaagc actactgcac tttacagaat taccccactg 1114 
gatcctggac ccacagaatt ccttcaggat ccttcttgct gccagactga aagcaaaagg 1174 
aattatttcc cctcaagttt tctaagtgat ttcca 12 09 



<210> 2 
<211> 273 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met lie Phe Leu Leu Leu Met Leu Ser Leu Glu Leu Gin Leu His Gin 
15 10 15 

He Ala Ala Leu Phe Thr Val Thr Val Pro Lys Glu Leu Tyr He He 



Glu His Gly Ser Asn Val Thr Leu Glu Cys Asn Phe Asp Thr Gly Ser 
35 40 45 

His Val Asn Leu Gly Ala He Thr Ala Ser Leu Gin Lys Val Glu Asn 



Asp Thr Ser Pro His Arg Glu Arg Ala Thr Leu Leu Glu Glu Gin Leu 
65 70 75 80 

Pro Leu Gly Lys Ala Ser Phe His He Pro Gin Val Gin Val Arg Asp 



Glu Gly Gin Tyr Gin Cys He He He Tyr Gly Val Ala Trp Asp Tyr 
100 105 HO 

Lys Tyr Leu Thr Leu Lys Val Lys Ala Ser Tyr Arg Lys He Asn Thr 
115 120 125 

His He Leu Lys Val Pro Glu Thr Asp Glu Val Glu Leu Thr Cys Gin 
130 135 140 

Ala Thr Gly Tyr Pro Leu Ala Glu Val Ser Trp Pro Asn Val Ser Val 
145 150 155 160 

Pro Ala Asn Thr Ser His Ser Arg Thr Pro Glu Gly Leu Tyr Gin Val 
165 170 175 

Thr Ser Val Leu Arg Leu Lys Pro Pro Pro Gly Arg Asn Phe Ser Cys 
180 185 190 

Val Phe Trp Asn Thr His Val Arg Glu Leu Thr Leu Ala Ser He Asp 
195 200 205 

Leu Gin Ser Gin Met Glu Pro Arg Thr His Pro Thr Trp Leu Leu His 





210 



215 



220 



He Phe He Pro Ser Cys He He Ala Phe He Phe He Ala Thr Val 

225 230 235 240 

He Ala Leu Arg Lys Gin Leu Cys Gin Lys Leu Tyr Ser Ser Lys Asp 

245 250 255 

Thr Thr Lys Arg Pro Val Thr Thr Thr Lys Arg Glu Val Asn Ser Ala 

260 265 270 



He 



<210> 3 
<211> 254 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> TRANSMEM 
<222> (202) . . (222) 

<400> 3 

Leu Phe Thr Val Thr Val Pro Lys Glu Leu Tyr He He Glu His Gly 
15 10 15 

Ser Asn Val Thr Leu Glu Cys Asn Phe Asp Thr Gly Ser His Val Asn 
20 25 30 

Leu Gly Ala He Thr Ala Ser Leu Gin Lys Val Glu Asn Asp Thr Ser 
35 40 45 

Pro His Arg Glu Arg Ala Thr Leu Leu Glu Glu Gin Leu Pro Leu Gly 
50 55 60 

Lys Ala Ser Phe His He Pro Gin Val Gin Val Arg Asp Glu Gly Gin 
65 70 75 80 

Tyr Gin Cys He He He Tyr Gly Val Ala Trp Asp Tyr Lys Tyr Leu 
85 90 95 

Thr Leu Lys Val Lys Ala Ser Tyr Arg Lys He Asn Thr His He Leu 
100 105 HO 

Lys Val Pro Glu Thr Asp Glu Val Glu Leu Thr Cys Gin Ala Thr Gly 
115 120 125 

Tyr Pro Leu Ala Glu Val Ser Trp Pro Asn Val Ser Val Pro Ala Asn 
130 135 140 

Thr Ser His Ser Arg Thr Pro Glu Gly Leu Tyr Gin Val Thr Ser Val 
145 150 155 160 

Leu Arg Leu Lys Pro Pro Pro Gly Arg Asn Phe Ser Cys Val Phe Trp 



165 



170 



175 



4 




Asn Thr His Val Arg Glu Leu Thr 
180 

Gin Met Glu Pro Arg Thr His Pro 
195 200 

Pro Ser Cys He He Ala Phe He 
210 215 

Arg Lys Gin Leu Cys Gin Lys Leu 
225 230 

Arg Pro Val Thr Thr Thr Lys Arg 
245 




Leu Ala Ser He Asp Leu Gin Ser 
185 190 

Thr Trp Leu Leu His He Phe He 
205 

Phe He Ala Thr Val He Ala Leu 
220 

Tyr Ser Ser Lys Asp Thr Thr Lys 
235 240 

Glu Val Asn Ser Ala He 
250 



<210> 4 
<211> 224 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Gly His Thr Arg Arg Gin Gly Thr Ser Pro Ser Lys Cys Pro Tyr 
15 10 15 

Leu Asn Phe Phe Gin Leu Leu Val Leu Ala Gly Leu Ser His Phe Cys 
20 25 30 

Ser Gly Val He His Val Thr Lys Glu Val Lys Glu Val Ala Thr Leu 
35 40 45 

Ser Cys Gly His Asn Val Ser Val Glu Glu Leu Ala Gin Thr Arg He 
50 55 60 

Tyr Trp Gin Lys Glu Lys Lys Met Val Leu Thr Met Met Ser Gly Asp 
65 70 75 80 

Met Asn He Trp Pro Glu Tyr Lys Asn Arg Thr He Phe Asp He Thr 
85 90 95 

Asn Asn Leu Ser He Val He Leu Ala Leu Arg Pro Ser Asp Glu Gly 
100 105 HO 

Thr Tyr Glu Cys Val Val Leu Lys Tyr Glu Lys Asp Ala Phe Lys Arg 
115 120 125 

Glu His Leu Ala Glu Val Thr Leu Ser Val Lys Ala Asp Phe Pro Thr 
130 135 140 

Pro Ser He Ser Asp Phe Glu He Pro Thr Ser Asn He Arg Arg He 
145 150 155 160 

He Cys Ser Thr Ser Gly Gly Phe Pro Glu Pro His Leu Ser Trp Leu 
165 170 175 

Glu Asn Gly Glu Glu Leu Asn Ala He Asn Thr Thr Val Ser Gin Asp 
180 185 190 



5 



Pro Glu Thr Glu Leu Tyr Ala Val Ser Ser Lys Leu Asp Phe Asn Met 
195 200 205 

Thr Thr Asn His Ser Phe Met Cys Leu He Lys Tyr Gly His Leu Arg 
210 215 220 



<210> 5 
<211> 323 
<212> PRT 

<213> Homo sapiens 
<400> 5 

Met Gly Leu Ser Asn He Leu Phe Val Met Ala Phe Leu Leu Ser Gly 
15 10 15 

Ala Ala Pro Leu Lys He Gin Ala Tyr Phe Asn Glu Thr Ala Asp Leu 
20 25 30 

Pro Cys Gin Phe Ala Asn Ser Gin Asn Gin Ser Leu Ser Glu Leu Val 
35 40 45 

Val Phe Trp Gin Asp Gin Glu Asn Leu Val Leu Asn Glu Val Tyr Leu 
50 55 60 

Gly Lys Glu Lys Phe Asp Ser Val His Ser Lys Tyr Met Gly Arg Thr 
65 70 75 80 

Ser Phe Asp Ser Asp Ser Trp Thr Leu Arg Leu His Asn Leu Gin He 
85 90 95 

Lys Asp Lys Gly Leu Tyr Gin Cys He He His His Lys Lys Pro Thr 
100 105 HO 

Gly Met He Arg He His Gin Met Asn Ser Glu Leu Ser Val Leu Ala 
115 120 125 

Asn Phe Ser Gin Pro Glu He Val Pro He Ser Asn He Thr Glu Asn 
130 135 140 

Val Tyr He Asn Leu Thr Cys Ser Ser He His Gly Tyr Pro Glu Pro 
145 150 155 160 

Lvs Lys Met Ser Val Leu Leu Arg Thr Lys Asn Ser Thr He Glu Tyr 
165 170 175 

Asp Gly He Met Gin Lys Ser Gin Asp Asn Val Thr Glu Leu Tyr Asp 
180 185 190 

Val Ser He Ser Leu Ser Val Ser Phe Pro Asp Val Thr Ser Asn Met 
195 200 205 

Thr He Phe Cys He Leu Glu Thr Asp Lys Thr Arg Leu Leu Ser Ser 
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• 



210 

Pro Phe Ser He 
225 

Pro Trp He Thr 



Phe Cys Leu He 
260 



Ser Tyr Lys Cys 
275 



Thr Lys Lys Arg 
290 

Gin Arg Val Phe 
305 

Thr Cys Phe 



215 

Glu Leu Glu Asp 
230 

Ala Val Leu Pro 
245 

Leu Trp Lys Trp 



Gly Thr Asn Thr 
280 



Glu Lys He His 
295 

Lys Ser Ser Lys 
310 



220 



Pro Gin Pro Pro 
235 

Thr Val He He 
250 

Lys Lys Lys Lys 
265 

Met Glu Arg Glu 



He Pro Glu Arg 
300 



Thr Ser Ser Cys 
315 



Pro Asp His He 
240 



Cys Val Met Val 
255 

Arg Pro Arg Asn 
270 

Glu Ser Glu Gin 
285 

Ser Asp Glu Ala 



Asp Lys Ser Asp 
320 



<210> 6 
<211> 290 
<212> PRT 

<213> Homo sapiens 



<400> 6 

Met Arg He Phe Ala Val Phe He Phe Met Thr Tyr Trp His Leu Leu 
15 10 15 

Asn Ala Phe Thr Val Thr Val Pro Asp Lys Leu Tyr Val Val Glu Tyr 
20 25 30 

Gly Ser Asn Met Thr He Glu Cys Lys Phe Pro Val Glu Lys Gin Leu 
35 40 45 

Asp Leu Ala Ala Leu He Val Tyr Trp Glu Met Glu Asp Lys Asn He 
50 55 60 

He Gin Phe Val His Gly Glu Glu Asp Leu Lys Val Gin His Ser Ser 



Tyr Arg Gin Arg Ala Arg Leu Leu Lys Asp Gin Leu Ser Leu Gly Asn 
85 90 95 

Ala Ala Leu Gin He Thr Asp Val Lys Leu Gin Asp Ala Gly Val Tyr 
100 105 HO 

Arq Cys Met He Ser Tyr Gly Gly Ala Asp Tyr Lys Arg He Thr Val 
115 120 125 

Lys Val Asn Ala Pro Tyr Asn Lys He Asn Gin Arg He Leu Val Val 
130 135 140 
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Asp Pro Val Thr Ser Glu His Glu 
145 150 

Pro Lys Ala Glu Val He Trp Thr 
165 

Gly Lys Thr Thr Thr Thr Asn Ser 
180 

Val Thr Ser Thr Leu Arg He Asn 
195 200 

Cys Thr Phe Arg Arg Leu Asp Pro 
210 215 

Val He Pro Glu Leu Pro Leu Ala 
225 230 

Leu Val He Leu Gly Ala He Leu 
245 

Phe lie Phe Arg Leu Arg Lys Gly 
260 

Gly He Gin Asp Thr Asn Ser Lys 
275 280 

Glu Thr 
290 




Leu Thr Cys Gin Ala Glu Gly Tyr 
155 160 

Ser Ser Asp His Gin Val Leu Ser 
170 175 

Lys Arg Glu Glu Lys Leu Phe Asn 
185 190 

Thr Thr Thr Asn Glu He Phe Tyr 
205 

Glu Glu Asn His Thr Ala Glu Leu 
220 

His Pro Pro Asn Glu Arg Thr His 
235 240 

Leu Cys Leu Gly Val Ala Leu Thr 
250 255 

Arg Met Met Asp Val Lys Lys Cys 
265 270 

Lys Gin Ser Asp Thr His Leu Glu 
285 



<210> 7 
<211> 302 
<212> PRT 

<213> Homo sapiens 



<400> 7 

Met Arg Leu Gly 
1 

Arg Ala Asp Thr 
20 

Val Glu Leu Ser 
35 

Asp Val Tyr Val 
50 

Tyr His He Pro 
65 

Arg Asn Arg Ala 



Ser Leu Arg Leu 
100 



Ser Pro Gly Leu 
5 

Gin Glu Lys Glu 



Cys Ala Cys Pro 
40 

Tyr Trp Gin Thr 
55 

Gin Asn Ser Ser 
70 

Leu Met Ser Pro 
85 

Phe Asn Val Thr 



Leu Phe Leu Leu 
10 

Val Arg Ala Met 
25 

Glu Gly Ser Arg 



Ser Glu Ser Lys 
60 

Leu Glu Asn Val 
75 

Ala Gly Met Leu 
90 

Pro Gin Asp Glu 
105 



Phe Ser Ser Leu 
15 

Val Gly Ser Asp 
30 

Phe Asp Leu Asn 
45 

Thr Val Val Thr 



Asp Ser Arg Tyr 
80 



Arg Gly Asp Phe 
95 

Gin Lys Phe His 
110 



Cys Leu Val Leu Ser Gin Ser Leu Gly Phe Gin Glu Val Leu Ser Val 
115 120 125 

Glu Val Thr Leu His Val Ala Ala Asn Phe Ser Val Pro Val Val Ser 
130 135 140 

Ala Pro His Ser Pro Ser Gin Asp Glu Leu Thr Phe Thr Cys Thr Ser 
145 150 155 160 

He Asn Gly Tyr Pro Arg Pro Asn Val Tyr Trp He Asn Lys Thr Asp 
165 170 175 

Asn Ser Leu Leu Asp Gin Ala Leu Gin Asn Asp Thr Val Phe Leu Asn 
180 185 190 

Met Arg Gly Leu Tyr Asp Val Val Ser Val Leu Arg He Ala Arg Thr 
195 200 205 

Pro Ser Val Asn He Gly Cys Cys He Glu Asn Val Leu Leu Gin Gin 
210 215 220 

Asn Leu Thr Val Gly Ser Gin Thr Gly Asn Asp He Gly Glu Arg Asp 
225 230 235 240 

Lys He Thr Glu Asn Pro Val Ser Thr Gly Glu Lys Asn Ala Ala Thr 
245 250 255 

Trp Ser He Leu Ala Val Leu Cys Leu Leu Val Val Val Ala Val Ala 
260 265 270 

He Gly Trp Val Cys Arg Asp Arg Cys Leu Gin His Ser Tyr Ala Gly 
275 280 285 

Ala Trp Ala Val Ser Pro Glu Thr Glu Leu Thr Gly His Val 
290 295 300 



<210> 8 
<211> 316 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> (233) 

<223> "Xaa" can be any naturally-occurring amino acid 
<400> 8 

Met Leu Arg Arg Arg Gly Ser Pro Gly Met Gly Val His Val Gly Ala 
1 5 10 15 

Ala Leu Gly Ala Leu Trp Phe Cys Leu Thr Gly Ala Leu Glu Val Gin 
20 25 30 

Val Pro Glu Asp Pro Val Val Ala Leu Val Gly Thr Asp Ala Thr Leu 
35 40 45 
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Cys 



Cys 
50 



Ser 



Phe Ser Pro Glu 
55 



Pro Gly Phe Ser 



Leu Ala 
60 



Gin 



Leu Asn 



Leu He Trp Gin Leu Thr Asp Thr Lys Gin Leu Val His Ser Phe Ala 
65 70 75 80 

Glu Gly Gin Asp Gin Gly Ser Ala Tyr Ala Asn Arg Thr Ala Leu Phe 
85 90 95 

Pro Asp Leu Leu Ala Gin Gly Asn Ala Ser Leu Arg Leu Gin Arg Val 
100 105 HO 

Arg Val Ala Asp Glu Gly Ser Phe Thr Cys Phe Val Ser He Arg Asp 
115 120 125 

Phe Gly Ser Ala Ala Val Ser Leu Gin Val Ala Ala Pro Tyr Ser Lys 
130 135 140 

Pro Ser Met Thr Leu Glu Pro Asn Lys Asp Leu Arg Pro Gly Asp Thr 
145 150 155 160 

Val Thr He Thr Cys Ser Ser Tyr Gin Gly Tyr Pro Glu Ala Glu Val 
165 170 175 

Phe Trp Gin Asp Gly Gin Gly Val Pro Leu Thr Gly Asn Val Thr Thr 
180 185 190 

Ser Gin Met Ala Asn Glu Gin Gly Leu Phe Asp Val His Ser Val Leu 
195 200 205 

Arg Val Val Leu Gly Ala Asn Gly Thr Tyr Ser Cys Leu Val Arg Asn 
210 215 220 

Pro Val Leu Gin Gin Asp Ala His Xaa Ser Val Thr He Thr Gly Gin 
225 230 235 240 

Pro Met Thr Phe Pro Pro Glu Ala Leu Trp Val Thr Val Gly Leu Ser 
245 250 255 

Val Cys Leu He Ala Leu Leu Val Ala Leu Ala Phe Val Cys Trp Arg 
260 265 270 

Lys He Lys Gin Ser Cys Glu Glu Glu Asn Ala Gly Ala Glu Asp Gin 
275 280 285 

Asp Gly Glu Gly Glu Gly Ser Lys Thr Ala Leu Gin Pro Leu Lys His 
290 295 300 

Ser Asp Ser Lys Glu Asp Asp Gly Gin Glu He Ala 
305 310 315 



<210> 9 
<211> 276 
<212> PRT 

<213> Homo sapiens 
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Met°Glu Ser Ala Ala Ala Leu His Phe Ser Arg Pro Ala Ser Leu Leu 
1 5 10 15 

Leu Leu Leu Leu Ser Leu Cys Ala Leu Val Ser Ala Gin Phe lie Val 
20 25 30 

Val Gly Pro Thr Asp Pro He Leu Ala Thr Val Gly Glu Asn Thr Thr 
35 40 45 

Leu Arg Cys His Leu Ser Pro Glu Lys Asn Ala Glu Asp Met Glu Val 
50 55 60 

Arg Trp Phe Arg Ser Gin Phe Ser Pro Ala Val Phe Val Tyr Lys Gly 
65 70 75 80 

Gly Arg Glu Arg Thr Glu Glu Gin Met Glu Glu Tyr Arg Gly Arg Thr 
85 90 95 

Thr Phe Val Ser Lys Asp He Ser Arg Gly Ser Val Ala Leu Val He 
100 105 HO 

His Asn He Thr Ala Gin Glu Asn Gly Thr Tyr Arg Cys Tyr Phe Gin 
115 120 125 

Glu Gly Arg Ser Tyr Asp Glu Ala He Leu His Leu Val Val Ala Gly 
130 135 140 

Leu Gly Ser Lys Pro Leu He Ser "Met Arg Gly His Glu Asp Gly Gly 
145 150 155 160 

He Arg Leu Glu Cys He Ser Arg Gly Trp Tyr Pro Lys Pro Leu Thr 
165 170 175 

Val Trp Arg Asp Pro Tyr Gly Gly Val Ala Pro Ala Leu Lys Glu Val 
180 185 190 

Ser Met Pro Asp Ala Asp Gly Leu Phe Met Val Thr Thr Ala Val He 
195 200 205 

He Arg Asp Lys Ser Val Arg Asn Met Ser Cys Ser He Asn Asn Thr 
210 215 220 

Leu Leu Gly Gin Lys Lys Glu Ser Val He Phe He Pro Glu Ser Phe 
225 230 235 240 

Met Pro Ser Val Ser Pro Cys Ala Val Ala Leu Pro He He Val Val 
245 250 255 

He Leu Met He Pro He Ala Val Cys He Tyr Trp He Asn Lys Leu 
260 265 270 

Gin Lys Glu Lys 
275 
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<210> 10 
<211> 523 
<212> PRT 

<213> Homo sapiens 

<400> 10 ^ 
Met Glu Pro Ala Ala Ala Leu His Phe Ser Leu Pro Ala Ser Leu Leu 
15 10 I 5 

Leu Leu Leu Leu Leu Leu Leu Leu Ser Leu Cys Ala Leu Val Ser Ala 
20 25 30 

Gin Phe Thr Val Val Gly Pro Ala Asn Pro He Leu Ala Met Val Gly 
35 40 45 

Glu Asn Thr Thr Leu Arg Cys His Leu Ser Pro Glu Lys Asn Ala Glu 
50 55 60 

Asp Met Glu Val Arg Trp Phe Arg Ser Gin Phe Ser Pro Ala Val Phe 
65 70 75 80 

Val Tyr Lys Gly Gly Arg Glu Arg Thr Glu Glu Gin Met Glu Glu Tyr 
85 90 95 

Arg Gly Arg He Thr Phe Val Ser Lys Asp He Asn Arg Gly Ser Val 
100 105 HO 

Ala Leu Val He His Asn Val Thr Ala Gin Glu Asn Gly He Tyr Arg 
115 120 125 

Cys Tyr Phe Gin Glu Gly Arg Ser Tyr Asp Glu Ala He Leu Arg Leu 
130 135 140 

Val Val Ala Gly Leu Gly Ser Lys Pro Leu He Glu He Lys Ala Gin 
145 150 155 160 

Glu Asp Gly Ser He Trp Leu Glu Cys He Ser Gly Gly Trp Tyr Pro 
165 170 175 

Glu Pro Leu Thr Val Trp Arg Asp Pro Tyr Gly Glu Val Val Pro Ala 
180 185 190 

Leu Lys Glu Val Ser lie Ala Asp Ala Asp Gly Leu Phe Met Val Thr 
195 200 205 

Thr Ala Val He He Arg Asp Lys Tyr Val Arg Asn Val Ser Cys Ser 
210 215 220 

Val Asn Asn Thr Leu Leu Gly Gin Glu Lys Glu Thr Val He Phe He 
225 230 235 240 

Pro Glu Ser Phe Met Pro Ser Ala Ser Pro Trp Met Val Ala Leu Ala 
245 250 255 

Val He Leu Thr Ala Ser Pro Trp Met Val Ser Met Thr Val He Leu 
260 265 270 
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Ala Val Phe lie lie Phe Met Ala Val Ser lie Cys Cys lie Lys Lys 
275 280 285 

Leu Gin Arg Glu Lys Lys lie Leu Ser Gly Glu Lys Lys Val Glu Gin 
290 295 300 

Glu Glu Lys Glu He Ala Gin Gin Leu Gin Glu Glu Leu Arg Trp Arg 
305 310 315 320 

Arg Thr Phe Leu His Ala Ala Asp Val Val Leu Asp Pro Asp Thr Ala 
325 330 33b 

His Pro Glu Leu Phe Leu Ser Glu Asp Arg Arg Ser Val Arg Arg Gly 
340 345 350 

Pro Tyr Arg Gin Arg Val Pro Asp Asn Pro Glu Arg Phe Asp Ser Gin 
355 360 365 

Pro Cys Val Leu Gly Trp Glu Ser Phe Ala Ser Gly Lys His Tyr Trp 
370 375 380 

Glu Val Glu Val Glu Asn Val Met Val Trp Thr Val Gly Val Cys Arg 
385 390 395 400 

His Ser Val Glu Arg Lys Gly Glu Val Leu Leu He Pro Gin Asn Gly 
405 410 415 

Phe Trp Thr Leu Glu Met Phe Gly Asn Gin Tyr Arg Ala Leu Ser Ser 
420 425 430 

Pro Glu Arg He Leu Pro Leu Lys Glu Ser Leu Cys Arg Val Gly Val 
435 440 445 

Phe Leu Asp Tyr Glu Ala Gly Asp Val Ser Phe Tyr Asn Met Arg Asp 
450 455 460 

Arq Ser His He Tyr Thr Cys Pro Arg Ser Ala Phe Thr Val Pro Val 
465 470 475 480 

Arq Pro Phe Phe Arg Leu Gly Ser Asp Asp Ser Pro He Phe He Cys 
485 490 495 

Pro Ala Leu Thr Gly Ala Ser Gly Val Met Val Pro Glu Glu Gly Leu 
500 505 510 

Lys Leu His Arg Val Gly Thr His Gin Ser Leu 
515 520 



<210> 11 
<211> 263 
<212> PRT 

<213> Homo sapiens 
<400> 11 

Phe His Val Ser Leu Leu Leu Val Gin Leu Leu Thr Pro Cys Ser Ala 
15 10 15 
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Gin Phe Ser Val Leu Gly Pro Ser Gly Pro He Leu Ala Met Val Gly 
20 25 30 

Glu Asp Ala Asp Leu Pro Cys His Leu Phe Pro Thr Met Ser Ala Glu 
35 40 45 

Thr Met Glu Leu Lys Trp Val Ser Ser Ser Leu Arg Gin Val Val Asn 
50 55 60 



Val Tyr Ala 



Asp Gly Lys Glu Val Glu Asp Arg Gin Ser Ala Pro Tyr 



Arg Gly Arg Thr Ser lie Leu Arg Asp Gly He Thr Ala Gly Lys Ala 
85 9° 95 

Ala Leu Arg He His Asn Val Thr Ala Ser Asp Ser Gly Lys Tyr Leu 
100 105 HO 

Cvs Tyr Phe Gin Asp Gly Asp Phe Tyr Glu Lys Ala Leu Val Glu Leu 
115 120 125 

Lys Val Ala Ala Leu Gly Ser Asn Leu His Val Glu Val Lys Gly Tyr 
130 135 140 

Glu Asp Gly Gly He His Leu Glu Cys Arg Ser Thr Gly Trp Tyr Pro 
145 150 155 160 

Gin Pro Gin He Gin Trp Ser Asn Ala Lys Gly Glu Asn He Pro Ala 
165 170 175 

Val Glu Ala Pro Val Val Ala Asp Gly Val Gly Leu Tyr Glu Val Ala 
180 185 190 

Ala Ser Val He Met Arg Gly Gly Ser Gly Glu Gly Val Ser Cys He 
195 200 205 

He Arg Asn Ser Leu Leu Gly Leu Glu Lys Thr Ala Ser He Ser He 
210 215 220 

Ala Asp Pro Phe Phe Arg Ser Ala Gin Pro Trp He Ala Ala Leu Ala 
225 230 235 240 

Glv Thr Leu Pro He Leu Leu Leu Leu Leu Ala Gly Ala Ser Tyr Phe 
245 250 255 

Leu Trp Arg Gin Gin Lys Glu 
260 



<210> 12 
<211> 584 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Met Lys Met Ala Ser Ser Leu Ala Phe Leu Leu Leu Asn Phe His Val 
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• 



10 



15 



Ser Leu Phe Leu Val Gin Leu Leu Thr Pro Cys Ser Ala Gin Phe Ser 



20 



Val Leu Gly Pro Ser Gly Pro lie Leu Ala Met Val Gly Glu Asp Ala 

35 40 
Asp Leu Pro Cys His Leu Phe Pro Thr Met Ser Ala Glu Thr Met Glu 
50 55 60 

Leu Arg Trp Val Ser Ser Ser Leu Arg Gin Val Val Asn Val Tyr Ala 

65 70 

Asp Gly Lys Glu Val Glu Asp Arg Gin Ser Ala Pro Tyr Arg Gly Arg 

85 90 yD 

Thr Ser He Leu Arg Asp Gly He Thr Ala Gly Lys Ala Ala Leu Arg 
„~ 11U 



100 



He His Asn Val Thr Ala Ser Asp Ser Gly Lys Tyr Leu Cys Tyr Phe 
120 125 



115 



Gin Asp Gly Asp Phe Tyr Glu Lys Ala Leu Val Glu Leu Lys Val Ala 
130 135 I 40 

Ala Leu Gly Ser Asp Leu His He Glu Val Lys Gly Tyr Glu Asp Gly 
- . _ 1 cn 155 ifc)U 



145 



Gly He His Leu Glu Cys Arg Ser Thr Gly Trp Tyr Pro Gin Pro Gin 

170 I 75 



165 



He Lvs Trp Ser Asp Thr Lys Gly Glu Asn He Pro Ala Val Glu Ala 
180 185 190 

Pro Val Val Ala Asp Gly Val Gly Leu Tyr Ala Val Ala Ala Ser Val 
onn 205 



195 



He Met Arg Gly Ser Ser Gly Gly Gly Val Ser Cys He He Arg Asn 
210 215 220 

Ser Leu Leu Gly Leu Glu Lys Thr Ala Ser He Ser He Ala Asp Pro 
225 230 235 240 

Phe Phe Arg Ser Ala Gin Pro Trp He Ala Ala Leu Ala Gly Thr Leu 
245 250 255 

Pro He Ser Leu Leu Leu Leu Ala Gly Ala Ser Tyr Phe Leu Trp Arg 
260 265 270 

Gin Gin Lys Glu Lys He Ala Leu Ser Arg Glu Thr Glu Arg Glu Arg 
275 9R0 285 



Glu Met Lys Glu Met Gly Tyr Ala Ala Thr Glu Gin Glu He Ser Leu 
290 295 300 

Arg Glu Lys Leu Gin Glu Glu Leu Lys Trp Arg Lys He Gin Tyr Met 
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Ala Arq Gly Glu Lys Ser Leu Ala Tyr His Glu Trp Lys Met Ala Leu 
325 330 335 

Phe Lys Pro Ala Asp Val He Leu Asp Pro Asp Thr Ala Asn Ala lie 
340 345 350 

Leu Leu Val Ser Glu Asp Gin Arg Ser Val Gin Arg Ala Glu Glu Pro 
355 360 365 

Arg Asp Leu Pro Asp Asn Pro Glu Arg Phe Glu Trp Arg Tyr Cys Val 
370 375 380 

Leu Gly Cys Glu Asn Phe Thr Ser Gly Arg His Tyr Trp Glu Val Glu 
385 390 395 400 

Val Gly Asp Arg Lys Glu Trp His He Gly Val Cys Ser Lys Asn Val 
405 410 415 

Glu Arg Lys Lys Gly Trp Val Lys Met Thr Pro Glu Asn Gly Tyr Trp 
420 425 430 

Thr Met Gly Leu Thr Asp Gly Asn Lys Tyr Arg Ala Leu Thr Glu Pro 
435 440 445 

Arg Thr Asn Leu Lys Leu Pro Glu Pro Pro Arg Lys Val Gly He Phe 
450 455 460 

Leu Asp Tyr Glu Thr Gly Glu He Ser Phe Tyr Asn Ala Thr Asp Gly 
46 5 470 475 480 

Ser His He Tyr Thr Phe Pro His Ala Ser Phe Ser Glu Pro Leu Tyr 
485 490 495 

Pro Val Phe Arg He Leu Thr Leu Glu Pro Thr Ala Leu Thr He Cys 
500 505 510 

Pro He Pro Lys Glu Val Glu Ser Ser Pro Asp Pro Asp Leu Val Pro 
515 520 525 

Asp His Ser Leu Glu Thr Pro Leu Thr Pro Gly Leu Ala Asn Glu Ser 
530 535 540 

Gly Glu Pro Gin Ala Glu Val Thr Ser Leu Leu Leu Pro Ala His Pro 
545 550 555 560 

Gly Ala Glu Val Ser Pro Ser Ala Thr Thr Asn Gin Asn His Lys Leu 
565 570 575 

Gin Ala Arg Thr Glu Ala Leu Tyr 
580 



<210> 13 
<211> 526 
<212> PRT 
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<213> Homo sapiens 

Met°Ala 3 Val Phe Pro Ser Ser Gly Leu Pro Arg Cys Leu Leu Thr Leu 
1 5 10 15 

He Leu Leu Gin Leu Pro Lys Leu Asp Ser Ala Pro Phe Asp Val lie 
20 25 30 

Gly Pro Pro Glu Pro He Leu Ala Val Val Gly Glu Asp Ala Glu Leu 
35 40 45 

Pro Cys Arg Leu Ser Pro Asn Ala Ser Ala Glu His Leu Glu Leu Arg 
50 55 60 

Trp Phe Arg Lys Lys Val Ser Pro Ala Val Leu Val His Arg Asp Gly 
65 "70 75 80 

Arq Glu Gin Glu Ala Glu Gin Met Pro Glu Tyr Arg Gly Arg Ala Thr 
85 90 95 

Leu Val Gin Asp Gly He Ala Lys Gly Arg Val Ala Leu Arg He Arg 
100 105 HO 

Glv Val Arg Val Ser Asp Asp Gly Glu Tyr Thr Cys Phe Phe Arg Glu 
115 120 125 

Asp Gly Ser Tyr Glu Glu Ala Leu Val His Leu Lys Val Ala Ala Leu 
130 135 140 

Gly Ser Asp Pro His He Ser Met Gin Val Gin Glu Asn Gly Glu He 
145 150 155 160 

Cys Leu Glu Cys Thr Ser Val Gly Trp Tyr Pro Glu Pro Gin Val Gin 
165 170 175 

Trp Arg Thr Ser Lys Gly Glu Lys Phe Pro Ser Thr Ser Glu Ser Arg 
180 185 190 

Asn Pro Asp Glu Glu Gly Leu Phe Thr Val Ala Ala Ser Val He He 
195 200 205 

Arg Asp Thr Ser Thr Lys Asn Val Ser Cys Tyr He Gin Asn Leu Leu 
210 215 220 

Leu Gly Gin Glu Lys Lys Val Glu He Ser He Pro Ala Ser Ser Leu 
225 230 235 240 

Pro Arg Leu Thr Pro Trp He Val Ala Val Ala Val He Leu Met Val 
245 250 255 

Leu Gly Leu Leu Thr He Gly Ser He Phe Phe Thr Trp Arg Leu Tyr 
260 265 270 

Asn Glu Arg Pro Arg Glu Arg Arg Asn Glu Phe Ser Ser Lys Glu Arg 
275 280 285 
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Leu Leu Glu Glu Leu Lys Trp Lys Lys Ala Thr Leu His Ala Val Asp 
290 295 300 

Val Thr Leu Asp Pro Asp Thr Ala His Pro His Leu Phe Leu Tyr Glu 
305 310 315 320 

Asp Ser Lys Ser Val Arg Leu Glu Asp Ser Arg Gin Lys Leu Pro Glu 
325 330 335 

Lys Thr Glu Arg Phe Asp Ser Trp Pro Cys Val Leu Gly Arg Glu Thr 
340 345 350 

Phe Thr Ser Gly Arg His Tyr Trp Glu Val Glu Val Gly Asp Arg Thr 
355 360 365 

Asd Trp Ala lie Gly Val Cys Arg Glu Asn Val Met Lys Lys Gly Phe 
370 375 380 

Asp Pro Met Thr Pro Glu Asn Gly Phe Trp Ala Val Glu Leu Tyr Gly 
385 390 395 400 

Asn Gly Tyr Trp Ala Leu Thr Pro Leu Arg Thr Pro Leu Pro Leu Ala 
405 410 415 

Gly Pro Pro Arg Arg Val Gly He Phe Leu Asp Tyr Glu Ser Gly Asp 
420 425 430 

He Ser Phe Tyr Asn Met Asn Asp Gly Ser Asp He Tyr Thr Phe Ser 
435 440 445 

Asn Val Thr Phe Ser Gly Pro Leu Arg Pro Phe Phe Cys Leu Trp Ser 
450 455 460 

Ser Gly Lys Lys Pro Leu Thr He Cys Pro He Ala Asp Gly Pro Glu 
465 470 475 480 

Arg Val Thr Val He Ala Asn Ala Gin Asp Leu Ser Lys Glu He Pro 
485 490 495 

Leu Ser Pro Met Gly Glu Glu Ser Ala Pro Arg Asp Ala Asp Thr Leu 
500 505 510 

His Ser Lys Leu He Pro Thr Gin Pro Ser Gin Gly Ala Pro 
515 520 525 



<210> 14 

<211> 7819 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> exon 

<222> (4599) . . (4652) 

<223> coding portion of exon 1 

<220> 
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<221> unsure 

<222> (145) . . (292) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (1977) . . (2459) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (2465) . . (2576) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (2593) . . (3410) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (3490) . . (3533) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (6111) . . (6194) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (6836) . . (6870) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (7091) . . (7112) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (7118) . . (7420) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (7554) . . (7575) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (7694) . . (7718) 

<223> "N" can be A, C, T, or G 

<400> 14 

ataagaagct gaattaaggt gatggcagtg 




gggtggaaga aaggagagcc accatgcaaa 60 
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aagtatccag gagggagaat taacaggact aggggatggg ccatatttgc aagatgagaa 120 
atgcagaggt ctaagattct agctnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 180 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 24 0 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngcataaac 300 
catattttcc ccagaggagg attagtagga aaggaagctg ctggttggaa agtatcttta 360 
tagcagtgtc tgttcctcgg tttgctcaag gggacagtgt gccaggaaag tccccgtgga 4 20 
agggcaagga agaaggggaa gttaaagcca gtggcaggtg atccaagaat cttttctgtt 4 80 
gctagagcta tgttacatgc tgtcctttca tgctctaaaa ataagagtgc tggcaagtgc 540 
caggcctgtt ggtgcagctt aagatgatac ctttcttgga tatatatgca tctgaataag 600 
gaaggctatc ttctggtcaa gctaaggtat gccatgagca tttccctgtg gaaagcactt 660 
aattctgttc ccagttgtta cctgctgtaa gatctccctt tctaaaataa aaacaagaat 720 
acagctcact gaggacctta catttccctc tagctactga ctcatttctc ttctcctttt 780 
tatagcactc ttcttgagag agttgcctat atttgttgcc acatctttac ccattctctt 840 
ttgaacctat tcaagctttc atctgtacaa aactcactga tactgtgctt gtcaggatca 900 
tccatgacct ccatactgct aaatgcaact ctcaagagta tttggctcta ctgatcactc 960 
ctttgtagca ctgtgtttta aaatataggt tttattatta tttaggtatg gtgaggccaa 1020 
tatatcagga aatgactgtc gttgaaaaaa gtatgttgta ctcacagatc ccaagagaag 1080 
gggggcacac catgccacaa agggccacat ggggaagcac cagggtcagc caggaggtgg 1140 
gtggggggtg cgcaagatct ttattgtggt ttcaacagga agaaatgggt gaagcagggt 1200 
gagtggattt aggattagct gatataaata atttcagcag gctctggggc ataggggctg 1260 
tccctagtct tctggtactt ggccctgggg tgattaaggc agttgcatag tgttgggaat 1320 
gtgaaagccc ccaataaatg aggcagttgt gggtatgggc tctgaaatgg gttggtttgc 1380 
atttgaaagg tgtgctcatg ggcaagtggt ttactctctc ttagaggtta gaattggcta 1440 
accctgggag cggcagtccc ttcagggtca gcaaggcccc aggtgtcaaa gcatcagaat 1500 
acagaaaata aaatgcatgg ataatacaca ctgccatttg cctttgtacc cttcctttca 1560 
atcttctctg ctggtgaccg ctcttcacaa agatctataa atgttggaat accccatgtc 1620 
tcagtccttg ggcactctct ttcctatctc tctgtaggtg atgtaatgca gatatccatg 1680 
actttaaatc tttaacactt ctgcattgat gactcctaaa tttacatctc taccccaact 1740 
gcctactaaa cacctccact tggctatcta ataggcattt caaaccaaat ctacaacaaa 1800 
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cgtaactctt tttccccttc cttaatttgc 
cagcatctcc attgccttag tgactcaagc 
tctttttctc aaactatata tctagcctgt 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
ctaannnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
f?i catttatatt tannnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
~ nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
?== nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
aatatatata cacacttaga cacatataac 




ttctccccca gccttctcca ttttaataaa 1860 
cccaaactta ggaattttcc cagatttccc 1920 
cagcagttcc cttcaggtct tttttcnnnn 1980 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 204 0 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2100 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2160 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2220 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2280 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 234 0 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2 4 00 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnna 24 60 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2520 
nnnnnnnnnn nnnnnnnnnn nnnnnngctc 2580 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2 640 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 27 00 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 27 60 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2820 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2880 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2 94 0 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3000 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 30 60 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3120 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3180 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3240 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3300 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 33 60 
nnnnnnnnnn nnnnnnnnnn ttgtgtttta 3420 
cctctttcgt atatcaatta tactttaata 3480 
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aagctgttgn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnctcgcca 3540 
acagaccact tctcacccct actagcccct ctactctaag ctatcagcat ttctgcaaac 3600 
acattcctaa ccgatctcac tgcttgtaat cttgccagca acctctccct ctcagcaata 3660 
gtctattgcc tacaccaaag cttagttgtc tcttaatgat gtaaatgagg ttctatcatt 3720 
ctcctgaccc aaaccctcca ctgcttttca tcacactcag agcagctctg ctgttgcctg 3780 
atttagatgt atggctccaa cagatttccc ctgaagaaat gattccatgg ctgataaaag 3840 
ttggaaagcc tcctcagttt cagaccatta tcagattagc tgtgtgctct gtccctttcc 3900 
tcaaccataa gaagtccatg gataaagaaa gcttcagagt aaaggagaaa gcatgggagg 3960 
tacagcagga ccaaggtggg gcattcgcag cccccaccct catcagagcc agttccctac 4 020 
tctccctgtc taaacctctt agtaagaggt agttcaagag aggggcaaac tcaattccag 4 080 
cactcaaaag cacttgacta ctttgctcag tcaactagca agtatttatt gagaatgtag 4140 
ctctgttcta tggagtctta ttttcaagtg . tcagactccc agacatccag tccaggtaaa 4200 
gaagatggtg tccattattc atttgacaaa caaagttggg gttcaagggc cagctattga 4260 
aaaaagctat ggaaagcttc atgagacgtg caggtaactg ccaatatgtg tggttcacaa 4320 
ggactggttc atattcagaa acggccatta gaaaaggaag aagaacttct catttggatt 4380 
tataaagagt gtcttgttta ctcttaattt atatcttctc ttctccagga aatcaaccta 4440 
taacttctcc tcccagctcc actctaccat ggtctgtcac cttccccaaa tgatttgtta 4500 
ttcccctgtt ttcaaaagtg aacaaagaac caaagaccca gcaaagtttc acaaggccct 4560 
gagactttca attgtctatt tcagatcaaa tacagaac atg ate ttc etc ctg eta 4616 
atg ttg age ctg gaa ttg cag ctt cac cag ata gca ggtaagaaag 4 662 

gacaaaggga gaggcttaag aaagaagagc aggtggtggt tcctagccaa agecaaaaat 4722 
gagaatgtgg ccctcaggct gagggctttc tttgagagga cgtatgattt ctgggctatt 4782 
ccaagcacca caaaaaaaaa aagagtcccc atggtggctt atacatgeca atgtccctat 4842 
ctgacagaaa cggtgactga gaatattget ccatctattc ccactatcca gtgagggtaa 4 902 
tgacaagaag acaggatcac tcagaccatg taaatctaaa ctgatacaag agggcagggg 4 962 
ttgagttccc ttaaaggtga gatgecaage agctgtcccc ttcctttctg gcagggagag 5022 
taaggagaca atggccaggg aacacegtta ctctaaagat aatgtcttga agacattctg 5082 
catattatta gttgtttctg tgagtttctt ttttgaaaag caacaatagc agccgttggt 5142 
cattcatacc ttaatgtggt ttactgagtc ttcctaaaac ccaaatgaac aatgaacctt 5202 



aaggctatcc ctttggactt gaagaaagga cttctattgg aggatgaggg tgagcagaaa 5262 
gaaaagcagt ttcacagttg gttgttctcc tggggaaggt agttcagacc attcgagggt 5322 
gtagttagaa ccatgagtgc actattttgg atgaacacca ggagctaaga gagtaacata 5382 
gaggtgtgga cagaggatta agtcctcaag acaatagccc cagccccatg ggaaatcatc 5442 
tttctgctca tgattgagaa ataatggctc ccttggcact tgataacctt tcgaagagct 5502 
ttctcctccc tactagctgg ttccagatca ctcttcaccc agtcacattc ctctcactca 5562 
cttgagctgc ccagcctggt ctggcactag agacatgcac ttggggccct cctcaaagga 5622 
agaccctgag atattctgct tacttctact ctgctcctgc ctgcagggcc agctaaagga 5682 
acttttcatg ttttctttgc aaggaaccct gcctggctgg cattttagag acaagcaaaa 5742 
ggggcaataa cttccttgct acaaaacagc ttcaagtttc catagagtga taagggaaat 5802 
gagggccaaa agacactgtt ccccatcctg tggcaggact gggggcttca ggagaaaact 5862 
tggggaatgt gtaacctctg tgggtttgta gcttaaaaac actgagatcc tgggttttct 5922 
gtctttgttt tttgcctttt ctcttaggaa aggagtgagc tagggtgaca aggggcaaca 5982 
ttttttatcc ctcattggct ctttctacag aggaaggatc ttttcttcta agataatcag 6042 
cacaagacaa tgaagatagg cactagctcc cagttaggta tactaatggg gcaaaaggaa 6102 
gagcatttnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6162 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnattttgca ggtgagggaa ctagaactca 6222 
gaaaaggtac ttaatttccc caagattaca tagttattag gtgacaccgg caagatttca 6282 
acaaagctaa tgtcctttct actttactgt gctaccatga tgatggtaat caaaaatggc 6342 
agacaaccca taaatcttcc aactttggaa taggtttttg cactgaagtc tgaatatgga 6402 
tacgtattga atgtttattc tggatattca cagaatcaaa aaatatgtgt aatgaattat 64 62 
gttgctgaat taactgaaag gaaagtaaaa atgtagcgct ttctcatttt cttcacgaat 6522 
ttggaattct tttctgcttt ccactatgca gataacatca gttcagacaa atattaaata 6582 
cctacctaaa ttagaatgcc ttctcctcat gggatttttt taaaatcttg tcatttcatg 6642 
tctctttaat taaagagttt tgatttcaga ggagggtacc tgcaaaagaa aacaacaaaa 6702 
aaactaaagg atctgagaaa taattagtgt ttacttctgg ggaggggagg aggtctggga 6762 
tgggggtaaa aaggatagtc ttatctatta tgtatattca ggtttttgtt ttttacaaga 6822 
agcatgtatt aggnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnca agaaatttct 6882 
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catataaaaa tatgaaagta atcagactgc aacactcagt gcctgagaca gagctacagc 6942 
tatcagggtg tccagacaga cagaagatta cattttcttc cttgctcctt gtacagcccc 7002 
agacctgcat gcttcattga aaagaaaaga agatacctga attaaatcaa tgtgatgctt 7062 
agtaccctat cagtgcacat ttcttttcnn nnnnnnnnnn nnnnnnnnnn cacttnnnnn 7122 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7182 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7242 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7302 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7362 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnca 7422 
cttatggcgg tattctcagt cattacaaat aaataaaaac aatccatatg ccctggagaa 7482 
tttgattcca ggagtaggtc tagaagaact tcaactggag aatggataga gaaatcatgg 7542 
tatatttgca gnnnnnnnnn nnnnnnnnnn nnngatagca tgtgaataaa ttaattacaa 7 602 
aaacatatga ctacatctat tattatatag catgtagata aattacaaaa acatgtaact 7 662 
acatctatga atcttagagc ataatattga gnnnnnnnnn nnnnnnnnnn nnnnnnggcc 7722 
agaagataac acatagcaca atgtcctttt cataaataaa tatattgctt aagcatacct 7782 
tatatataga agataaagct taaaaagtaa agaagag 



7819 



<210> 15 
<211> 650 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> unsure 

<222> (20) . . (175) 

<223> "N" can be A, C, T, or G 

attatcactt atgagggtgn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 60 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 120 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnctgcc 18 0 
ctttattttt tcatgaaaga aattgctgaa gaggactaaa agaagtttta gtaagcattc 240 
aataaatgta tgttctttat agtttccaaa tcagcaaata tagacatcct gcatttttaa 300 
ggagatttat atattttatt ggacatgctg taatttattt aaccacttcc ctgttggtag 360 
acattatttc cattttcttc tgctagatta atgcttgaaa aaaatgtgtg cctcctaaag 420 



actgtgatga aagttgcctc tgaataaaac tcaaacaaat cattaatcat taactctttc 480 
cttacttgta tgctctttgg atgctctact gtgttatcta taaaataaag tttgaagtga 540 
aaaattaggg taaaacattt tatatcattt ttaaaggata tatacatgga tgtacttaca 600 
tatgcatgtt taaatttata taccataaca tttatttctt tttttaaaaa 650 



<210> 16 

<211> 9179 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> exon 

<222> (5211) . . (5516) 

<223> exon 2 

<220> 

<221> unsure 

<222> (1054) . . (1198) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (1297) . . (1430) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (1482) . . (1658) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (3286) . . (3529) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (3580) . . (3620) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (4193) . .(4416) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (4506) . . (4682) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 
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<222> (6166) . . (6223) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (6635) . . (6803) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (7946) . . (8351) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (8578) . . (8609) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 
<222> (8870) . . (9178) 



<223> "N" c 


.an be A, C, 


T, or G 










<400> 16 
tatatttgtt 


ttttttctta 


catttttatt 


tcaaaatcta 


aggacatctt 


ataacccaga 


60 


aatatttttt 


ataccttgtc 


atgtcttaga 


ggaaagagcc 


accccagtct 


tttttcattg 


120 


atgtttttct 


tctctcttcg 


tactccagag 


gtagatgaaa 


accagagggc 


cacaatgacc 


180 


atggtgatgc 


ctgaggtcat 


tctggggcac 


agacctcagc 


ctaggttact 


ccacttcgcc 


240 


tatctttaga 


tccaaaacta 


ccctgctgac 


tgctgagata 


aacaaaggag 


aataatcagg 


300 


ttggggaaag 


gatttctatg 


cgaagacatg 


tctccatgca 


gtcctcctac 


actgagcaga 


360 


gcatgagtca 


ggtgcttaga 


gcaggatttt 


gtcctaaacc 


aggaacttca 


gagttttctg 


420 


aagaatgtgg 


ctatgtaaag 


caccccccca 


ccccaccctt 


acttctcaag 


tacattacgt 


480 


ggcaagtctg 


aaaaaactta 


cacttctgtt 


gttaaatgtg 


ggggataaaa 


tataaactta 


540 


gtttcaagag 


gaagctatct 


tgggaggtaa 


tgcaaataat 


tcgttgtgtg 


tttcctgaat 


600 


aagtgacagg 


tgctgactac 


cattgatgct 


tcattgcaat 


aaaatgcaaa 


gctcccccaa 


660 


gaatttttga 


aatgcatcaa 


gctaggtgtt 


ctaatctagc 


aaaaggacct 


gcatacatga 


720 


atttttcatg 


cttttgccaa 


gtcttttgcc 


ctttagttta 


gttaagggcc 


ccacatgaat 


780 


ggaaagcctg 


tgttgtcagc 


ttaattttgt 


agttgtggaa 


accttccagt 


tttctccttt 


840 


gtctaatacc 


ttcaggagtt 


caatcctagg 


ttgaagctta 


atttaataac 


catgtggcat 


900 


gtaaagtaga 


aaacaaaaca 


tcttttcctt 


agcatacagc 


aaaaaaaaaa 


aaaaaactca 


960 


ctcatggatg 


tagtgtacac 


atgccagtgg 


atatatagtc 


ataactgcag 


tcattggtag 


1020 
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cacagaaata aatgtgcatt gaagacacag agannnnnnn nnnnnnnnnn nnnnnnnnnn 1080 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1140 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnngg 1200 
gctagcccag tgtctgactt actgtgttta agaaatatca actattacgc tacttcccag 1260 
tgacagtcca aatgcagacc agtgttataa ctctacnnnn nnnnnnnnnn nnnnnnnnnn 1320 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1380 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn aggcaaggtg 1440 
gtaaaagacg cttgcacgtg caaatgttac tttgtgtaac tnnnnnnnnn nnnnnnnnnn 1500 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1560 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1620 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnct tagtgaatct aatttgcagc 1680 
tgggcttgag aaaaaacctc ttcatagaat tgtttgcatc agtgtcttga ttgcctctgt 1740 
aacttacaat aagcaagaat gtttcaggat ttcaaaaatc tattgcattg cctaaacctc 1800 
ttattttgta tggagtaatc aagctcaaag tttgcatgtc ttagaaactt tacttggggc 1860 
aaaattagac caagtaacaa ttaatcttct aggtattctg agctattcag acatatgatt 1920 
catgtttgct aattgctctt ttctcttgta aatattagct gaaaaatgtc acctgtctga 1980 
caagtagcat attttatgcc tatcactcct ggcacgcatt cttacaaggc agacaggaaa 2040 
aataggaaga aaatggactt ttatcaaagg cccaggcagt aaagagggga gttctgctgt 2100 
aagctaaggg gagttccaga ggaagttata ggcgttccct ttcttatgac aagaaagcat 2160 
agtgcagtaa ataaatttgc taaatagatt caacagtctc tacccaaagt catctattta 2220 
attcttgttg ttatgcagac tcagcaacta accttccttg taagccccat tttcttccct 2280 
gtttcctgtt tatcaaatgt aattaaacaa gagaagtatt atagaagagt aaaagtagta 234 0 
ggtaattctt gaacttggca tatgattact acatatttga tgaatagttg aatattattc 2400 
ttcaaggaca gattggattt ggtatcaggt ggctctgcat taagttataa gggacttaat 24 60 
aactcaagta tttaaggacg gcttccatca taaagggatc tgcccttaag agggtcccat 2520 
tatggagatt ctgaggtgag agctattcca agtgtgcagt ggattaaaat aaaagaatca 2580 
tacaggaaat ctctttttac atgccttatt ccagggtctt tgcaacctgg cacagcaagt 2640 
gcagatatga ttagcattgt tttacacatg tacactcacc ttatagccct gcccctgtgc 2700 
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ccctcctgca caaaagaatg ctgggcacac gtgaactcct ctctgtagaa aggcacatta 2760 
atgttctagc catggttaaa acagggatag aggcaagcca aaaatgtcgg tcatttgaaa 2820 
taaatctcaa gtttgtgcat atcactatca agtgtgctgt gtggcaatta agaatgccaa 2880 
tttgtgtgat cacaggcaag ttgcagtttg atgaaaggaa agcagaggtg aatatataac 2940 
cagggtcatc ctttctttct ccctctctct ctttctgtca tttatttgcc aagctcttaa 3000 
ctagaacttg ctatgtgcta ggtactggat atatcaaagc aaactcagcc tggtctttgc 3060 
cttcaaagat ttgcaggata gtgggaagaa aaacttgaat cagaggacat ctgcagtggg 3120 
aatcattcaa gcagcagaaa acccaaaagt tacttatact gtgaaatctg atcagagaat 3180 
ggactgtcct ggttagtaaa atatcctgga ggataaagat tggccatgca ttccacatat 3240 
gaattaccac tttcccaaga attaaaacat ggtacgaaag aaaggnnnnn nnnnnnnnnn 3300 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3360 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3420 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 34 80 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnna aacaacttct 354 0 
ggaatagcta atgcttagaa gcagctccca aatatttgtn nnnnnnnnnn nnnnnnnnnn 3600 
nnnnnnnnnn nnnnnnnnnn gcaagctcta ctgaacataa tttgatctaa tcttctgtga 3660 
ttattcagaa actacttcaa gattttccta tacctccatc ataatgaata cccattcatt 3720 
aatgatggaa gcagcctaat tttgtcattt ttcacacttt attgatgtaa cactaccttt 3780 
actagtttgg ccactcctta tgcttttttt atagaactat ttagatcaat tcaactttta 3840 
aaaaataaag ccacataccc ctgtggtaga tgaaaaacaa gtatcatttg cactggtaaa 3900 
tagagaatag gaagaaaaat aaatgcagtg aaaataaagc agtgttatca aatcctaccc 3960 
agatactgtt atctacccgg aagcttcctg tttgattaaa aggaaaaata gccagtgtta 4020 
gaggtgtgga agtctagttg aaattatatg caattgaagg attaaaatag aattgaaaag 4080 
ggaataaatt cctctctgaa taatttaact ccctttaggc tttgattctg cctcatctaa 4140 
aatcatctta catacttcta gtggcgtgtc cctcacattt tggtaaactc tgnnnnnnnn 4200 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4 260 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4 320 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4 380 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnccat ttttcctttc cctttattgt 4 440 



cagaaaatag aaagcatcta cagtgggctt gtatgatgtg gtggttagaa atacctgatc 4500 
tgattnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4560 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4620 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4 680 
nnaccatcac tggtgagagg aagtgatacc tggcacaaaa atatatggat taatcaatat 4740 
ggattgaggg aaacaaacct ggagaatagg atgtgaaggt atttaagtaa catgagctca 4800 
gaccttgatg gtagggaagt cgaaaggaag cattttgttc ttatatgaca gatgacctgg 4860 
aatgactgca gggcttgggg ggtcagggac tggaggtggg agaggcctct gagagcaagc 4 920 
agtgctgtcc accagaagct cttgctgggg tgcccagaga ggagcaaagg gcagtcagct 4 980 
gcacaggagg gaatgtttgg aggagagagc cacctcagat cagcgggtca agaatcccac 5040 
tcttgcccag atggatgggg caaaggagaa aaaggattcg ccacgggaat gtccagataa 5100 
gacaggtgcc ttttggaaaa tgggggtgag atgggtctca ggttacactt cgtaagaact 5160 
ggaatgtaaa gtaaaggcag acaatgacaa aatatcttgt tttcttttca get tta 5216 
ttc aca gtg aca gtc cct aag gaa ctg tac ata ata gag cat ggc age 5264 
aat gtg ace ctg gaa tgc aac ttt gac act gga agt cat gtg aac ctt 5312 
gga gca ata aca gec agt ttg caa aag gtg gaa aat gat aca tec cca 5360 
cac cgt gaa aga gee act ttg ctg gag gag cag ctg ccc eta ggg aag 5408 
gee teg ttc cac ata cct caa gtc caa gtg agg gac gaa gga cag tac 5456 
caa tgc ata ate ate tat ggg gtc gec tgg gac tac aag tac ctg act 5504 
ctg aaa gtc aaa ggtgagtggt gtcaaggact agaatccatg gaagctctct 5556 
ccaacagagg atetgeaagt cacagaaacc cattaaaggt agctcaagca aaaacaagca 5616 
ggctgctttt aaggagacag ctatttcaga gaaaatgaaa gcatctgctc ggaaataatt 5676 
tttgacatct gagtacaaag cagecgaagt acaagtgaaa gggggtagga cctataggaa 5736 
taaaatggga ctggaggaag ccaggaaaat tagtccctga aatgtgggag ggtatgaaaa 5796 
ataagctttg cctaattcac aattctccca tggaacatcc ctgacttgat tattaagata 5856 
ctctttttca atagtttata ccctgaatcc agagttttta aaaccatggt ttgccgccca 5916 
ttcatggatt aaaatatcaa tttagtgagt agcaaccaga tgcacgtttc ccgcccttta 5976 
aaaaataatg tatagaagag aatagacaga gtagatcaga cgatatcaca gagtaggact 6036 
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gagtactgta aaactaattt ctgagggacg tgtgtgtgtg tgtgcgtgtt gggtcatggt 6096 
ataaattttt tttttcttac tttggatcat aaaaagttac aagtttggaa aacactgctc 6156 
aaatgcaagn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6216 
nnnnnnnaga gtcccatgaa gtctatagct gtccctattc ctctggattc agggatctct 627 6 
ccactccagc acaattgaaa atctaaatat aaagagaatc ttcacactct tgtttgttct 6336 
agaaaaggtg atttgaggaa agacatataa caactataaa aaatagattt tgcttgttca 6396 
ttggcttatg gtctccaggc ttgaatgctc tgagataaat gatgccaata tttctctggc 6456 
ctcttcccat cccacgcatt ggacctcaga tggtctgtac tgtcttctag agggtttgtg 6516 
ggttttggcc ccaaaaaacc attaaccttg gcagaaagtg tgtgacttta tgatctggta 6576 
caaagaagga caaactagag ggactggaca tgaggatgaa tattgtgttc gcccttatnn 6636 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6696 
\0 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6756 
~ Z -- nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaaa gaccctcagt 6816 
m tgttacaggg gcagtgacct cctcacacct caaccatcaa tgagtcacca ggaaagccat 6876 
H! tagcctagat gtaactgttt tctatcttta ttgcatttcc tacatccagg cagcagctgg 6936 
I. gaggaactct agaacactga agtttgtctg agttccctta atgtaaggct gtacattctc 6996 
-:• aggatgcctt gatgtactcg aatatctgca accctaaatc accacctctg tttttattga 7056 
.;. tctctatctg aatgctgtat taatgggcca ggccttctgc ccattctctc aaactgagaa 7116 
^' ctqtctctca ttcctgggga ggcaccctgc ctactcctta cctagatcag ggatttctca 7176 
gttgtggaga gatttgttcc ttatagtgtt ggtcatcaaa ctgggatatt tggggattac 7236 
aaagactttt caagggatgt atgggcacag gcagttttag gaagtgagtt cctagatcct 7296 
catcttcccc aaatactcgt tcccaaaatt gacgagcctg acaatgtgca tgccaggcaa 7356 
ggctcttggg gttcccctaa aacacttcct cttttaagcc taccactcac tcatcatgaa 7416 
tatagtccat tgtcccaggg tgtaaaaccc tctatagtgt taaataaaag aatgattggg 7476 
aacattgaca cctgatggaa ctgttatgac taaaaaccct tttgcaaata atgtggtatc 7536 
taattttctg ctttcaacaa aattgaagga ggcccttata aagttaataa ctgataatca 7596 
aaaatgagta atttttgcca tgtaaatcag gtcaaagaat gaaatggcat tgctgtaacg 7 656 
aaactgcttc cattcccatt gatttactca tacgaacaag attccttagc ctttataagc 7716 
tacaaaaaaa tgaaaaatag aaatagaatt gaggctgaat tctattatat aaaatcattc 7776 




caaccatgtc atatggttct tcggattcat 
cttattaagg gacacattcc caataaattt 
tcataacatt ttacattttg atcaaatatg 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
gaacatttaa agcttcatag tcacaagaga 
ttgttgttgt aaagacatat aacgataatc 
acattaagat aaaattctgt agggaaggtg 
gaaacaattt agacagagaa gnnnnnnnnn 
ggaattgaat agattaggtt ccttacccaa 
ttctcttttt tccacattct ccagtctcat 
gcatgtttca tacatattct tttgtttgta 
tagatgcata tatttttttt cttgtggaaa 
aatggacaca ttannnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnt 

<210> 17 

<211> 1814 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> exon 




gaataatttg gaaaagagag ccatatccat 7836 
tcatctttca tgtttaataa ttatcaatat 7896 
tgttaataat aatagaaatn nnnnnnnnnn 7956 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8016 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 807 6 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 813 6 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8196 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 825 6 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8316 
nnnnnattag atcttaatgc agaacaccct 8376 
aaagttttca tttcaatagc tataaatatt 8436 
aatacaaaat ctgtcaaaca aaaatatgtt 8496 
aaattggaag tgagtttcaa tgaatgaaaa 8556 
nnnnnnnnnn nnnnnnnnnn nnncttagaa 8 616 
aaagcctctg ttatttgtct tatttattta 8676 
tccccttttt taacacagga aattattcca 8736 
agagcttatt taaaatatgt aatattgttt 8796 
ctatattgta ctatatatat atattttaga 8856 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8916 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8 97 6 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9036 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9096 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9156 
9179 
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<222> (738) . . (1010) 
<223> exon 3 

<220> 

<221> unsure 

<222> (72) . . (118) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (405) . . (630) 

<223> "N" can be A, C, T, or G 

^agatctcag ctttcttgag gcagggagcc atatctgttt aattcactca gcatatactg 60 

caaagaagca gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnta 120 

gggagaagtg cagaataaat atacccaact ctttactatg tatagacatt atctaggtct 180 

ttattttttt tctcttctta atctcaaaga aaacagagga aaggaggaag taaaaagtaa 240 

atttttgcct gaagatgttt ggaaaaaata ccaaataaag tgagatagtg ggtaatctag 300 

tgatttttat ttttccgtcc tctttctggc ctccaattgt gaaataattt atagcactgt 360 

aagaaagaag ccacaaattg tggtagcttg gaccactgtt gaggnnnnnn nnnnnnnnnn 4 20 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4 80 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 54 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 600 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn acttaatatt gatagtgata attttattca 660 

tttcacatgg catgaagtac caagctctat aggaatcaga aaataaagtc ttatttcttt 720 

ttcttctcta ttgtcca get tec tac agg aaa ata aac act cac ate eta 770 

aag gtt cca gaa aca gat gag gta gag etc ace tgc cag get aca ggt 818 

tat cct ctg gca gaa gta tec tgg cca aac gtc age gtt cct gee aac 8 66 

acc age cac tec agg acc cct gaa ggc etc tac cag gtc ace agt gtt 914 

ctg cgc eta aag cca ccc cct ggc aga aac ttc age tgt gtg ttc tgg 962 

aat act cac gtg agg gaa ctt act ttg gee age att gac ctt caa agt 1010 
aagagctgee cccacttcct aggtctatca gttagggttc agacaagaaa cagatggcat 1070 
actcgagtga tttgaggaga gtgtaataaa gggactgttt acaaaggtgt gatcaccatt 1130 
tggagaaact acaaaggata gtgeagaaca ctggggcttc aatgttggga gggcaattac 1190 
cactgttgga gaagttactg gaatcagaag ggagctgtag ggaaagcccc acttcccagg 1250 
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agctgtagcc acagaatagg gaagctgcca catgcagcga ctccaaaggg tggaaactgg 1310 
atgaatgaat accccaactc attctcctcc caccctccaa tctcctgcta gcacctccca 1370 
ttggctgaac ccagctagaa gtcagagaat acaagggtcc actgttgtat tccataaaag 1430 
tcaacttctc agggctcaga gcaatattga catgtacaga atagatctgg agaggaaaca 14 90 
gaaaatatct agtacaatag ctaatcactg tgattcatgc acagtgtcat gagccagcag 1550 
gatgaatatt cctttgctgt acttgctgcc agtcagctgg ttatgggttt ttccaagaaa 1610 
tttggtctct aacaaaattc ttcagagcct ttactgacta tgctggatat ttttggaagg 1670 
gatcccatac ttttgaactt catacagcag aatttcaaac aatcttggga aaataacaac 1730 
ttttatctgc ccagtaagga caactaacac ctagtatcat aatcatttcg taagagacag 1790 
gtaatttcat caccgagtgc atat 



1814 



<210> 18 

<211> 19217 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> exon 

<222> (6412) . . (6543) 

<223> exon 4 

<220> 

<221> exon 

<222> (11953) . . (12003) 

<223> exon 5 

<220> 

<221> exon 

<222> (18746) . . (18751) 

<223> coding portion of exon 6 

<220> 

<221> unsure 

<222> (722) . . (1277) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (1740) . . (1924) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (3585) . . (3783) 

<223> "N" can be A, C, T, or G 
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<220> 

<221> unsure 

<222> (3929) . . (4224) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (5233) . . (5352) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (5974) . . (6030) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (6048) . . (6073) 

<223> "N" can be A, C, T, or G 



<220> 

<221> unsure 

<222> (6091) . . (6306) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (7576) . . (7688) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (8259) . . (8368) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (9787) . . (9885) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (9926) . . (10110) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (10113) . . (10241) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (11000) . . (11438) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 
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<222> (12528) . . (12844) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (13720) . . (13963) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (13983) . . (14275) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (14529) . . (14551) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (14585) . . (14621) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (14703) . . (14749) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (16380) . . (16436) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (16484) . . (16517) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (17022) . . (17366) 

<223> "N" can be A, C, T, or G 

ctactgagaa gggatatact ctcagactaa aggacagtcc ctagtactga ttcaatctgg 60 
ctttatagaa aattcactat attgtcattg tatttcacag tttgcccttt gtcttagctg 120 
gtaagacaga gcctatgata aggacttgtg tggcatgcag gtatttaatt ggcaacccca 180 
gagggcagaa gcaagagatt taggagttta agagagggta atataagagt atattatcaa 240 
agttgtagtg tggacaacag aaactcaaat attcaaggac cagcatgtag acagcctcct 300 
aagatgtcta ctcagacaaa gaatttcagg tggaaggact tgttcatctg cttcacgccc 360 
attggttgac aggaatatga actccattct gctgctgggc tagacatgca tgtgggctga 420 
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gtgagctttc cccagtatcc gtagcatcag 
aatttgaggt gaattactga ccttgaagtg 
ctggctgaag tgaaaggtga ggctgagagg 
gtactgttca aatccactca tgcccttcat 
tgggaggcag ccacaatctc tgcagaagat 
tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnacc tgcagcactg 
ttgaggaggt tgtcaggagg aaatgagtta 
cagttggggt gaggcagaca tggtatgagt 
ccaaattaaa attaattttt tcatggagtg 
tttgcagtga ctaaataccc atctcatcat 
ttcttccctg atacagtttt cagaggcagc 
acttccctac ctatccacaa tttactaccc 
ttctaactaa attattagtt tagaaggaaa 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnggggag aggaggggag aggaggtagg 
aattttagga ccaacttcag gtaaaaaaat 
ctacctctct ttatatcatg gttgagacca 




aaaagtcgca gggcagaaag aaaagtatcc 480 
agtgtaagcc taactagaat tctaccccag 540 
aaataaggca ggactgcaca gtccccaatt 600 
taagtcagct ctgccactga gccttccagc 660 
ttaatataca ccagtttgtg gaacaagctg 720 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 780 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 840 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 900 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 960 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1020 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1140 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1200 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1260 
ggatagccct ggtacagacg tagaacatgc 1320 
gaccttgcac agaactacca ccatcaagta 1380 
tgaggcatca gagatatctg atgctttatg 1440 
acactgatcc acagaccaga ctccaagaac 1500 
aactttcctg gtattttctt ctggaaaaaa 1560 
tagatgcact gtcatctctc cccttttccc 1620 
aatgccaaca ctaaagttag cccaacttcc 1680 
gagaggagtc atgctaagga tcttaactgn 1740 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1800 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 18 60 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1920 
gaggggaaag aaagaaacta gaaatccatc 1980 
gaattaggca agttggtctt tcaacattct 2040 
cagacttctc acctcatgaa agatgaactc 2100 
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taactaattc atactaaagc taaagcctct 
gaactttttt cccctggaat tgtttattca 
tggttaagtg taggccagta ctttggatga 
ataagaacct ctccagggaa acaggggcct 
ccactgatta tgtcagtgtt ctattctgca 
ttgctctgga agcctttacg ctgtgagaag 
ctcttcatgg aagccaacat ggcttactaa 
ggtcttcatc agagctcagg atctcctagg 
atggagaaat ctttgtgcag ccagaaattc 
ttgtattcgt cctactgttt actgacatgg 
accacatgct ctctgaactt ggctgcacct 
ttttccacat aaatatcatt atttccctct 
ggcagctcag tcactccagg gcacaagggg 
actacctgga ccatttgttt gccttgtaac 
ttccatgtat tttcattgtt cacacaagtc 
gtcctgtgtt cataataagt gctcaacaaa 
ttagataatc gcagtcccat catgccagct 
agccaaatga tattctagct tgtggcacaa 
atctgcctct gagttcctgg cttctatttc 
gttccctttc tactcattct cataaattga 
ccggaagtca atgtataaaa caaacacgga 
cactgactct agaatggatg actgaacatt 
agatcatttt ccttttgaaa tagggtttct 
ggaaatggaa tgggactaca gcacaaaaaa 
agaaaagaga atccttccta ggagacctgg 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnntcagaaa gtgtgcaaac agtaaaaaaa 




aaagaggatt aaatatgagc aatcccacga 2160 
actgtcgttc gttatatgga atttcctgcc 2220 
attgtagttt tctagaaaga cgcttcttat 2280 
gtatgagatg aattgagaaa taactttaca 2340 
tggtagagat gtgaaagggc agactgacca 2400 
ttaacagtgg agtaaaatgg ccactccact 24 60 
atagtcaaca accatgggag agacctgtgg 2520 
gtatcactca taaatacagc catcagggag 2580 
tcaacctggt tttacccatc cttcccaact 2640 
atcctctgct tcattaacca tcccttcctc 2700 
tttctacctc catgccttct ttgctcaggt 27 60 
ctactagctc caagcccacc ctctctctgg 2820 
gtctttccct catcccacat tttgagacct 2880 
tatgcttgcc tttttaattg ctattttatt 2940 
ttctttattc cacactaagg caaaagcaga 3000 
tgttgggttg attgggttgg agattccatc 3060 
accagactgt gtggacagcc aggtcagagc 3120 
ataccagcaa caaaataacc aaagtcacac 3180 
tcaagggcat ttttaagttg tcttatgact 3240 
gctgtggact gctgtgaccc acaagcttct 3300 
aacgaagagt atggtgggtg gagggtactc 3360 
ccaaatttca agcacaagtt agggagcaac 3420 
tctgctcagc cagttgttgt attttcatta 3480 
taaatataaa aggacccttg tagggctggc 354 0 
aggtgattcc aggcnnnnnn nnnnnnnnnn 3600 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3660 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3720 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3780 
aatggtatat ctagcaagtt gcatgcctta 3840 
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cttgtgagtt catgaagttg tggcaaggat aagacaaata ttttttgcca ttgcatcatt 3900 
atatcattgc taagagtatg ccattattnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3960 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4020 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4080 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4140 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4200 
nnnnnnnnnn nnnnnnnnnn nnnnttaatt aattaaatta aagaaacgac aaaagagtat 4260 
gcaagaattt taaaacaact tagaggaata tgtatgagga tacaggctaa gctaccataa 4320 
tgaagagacc tcgaaataca gtgagaagcg agacagaagt atctttcgtt ccatgtaaca 4380 
ctcaggtggt tcagagcagc taagcagcta tgttccatag agtcattcag tgatccagat 4440 
tattttcatc tgttgctctg ccattctcca ggatgttgtc cctataaaat tgtcaaagct 4500 
cagtcagtgc caaacccatg tttcaacctt cagaaagtaa acgagtggtg gaaaacacat 4 560 
tcaatgtttt aaggccaaga ccttgaaaac tcactctctt agcctgaact tagattacat 4 620 
ggctgggccc acttaactat aggggaggct tggaaacata gtctctgaga agccatgtgt 4 680 
ccagctaatt ccctaatact aaagttgaaa gaaagaatgg attaaccagc agtataccac 4740 
aaggtaacaa atgactagga ggatcaggct aggtggacta gaaaagagac agtcaattca 4800 
gtgcaacaat tccatattga cacttttcat gtagctgttg cttggctcta tctagagagg 4860 
actcagaggt agtttagata aggcctttgc cctccaaata cagtctaagc agactgattt 4920 
cctactggat gttcaacttt ggagtcttca gggatgagta gggcttctgt acgtggaaga 4980 
gactatgagg gaacctgcac aggacaaggg tttgcataaa gacactgagg tagggacctc 5040 
tcctgttgtg gggacagtga gaggcccagg tctccttgac tcacaaagtg cttactaagc 5100 
acttactaga aattaagaag cagattataa tcaatatggg ttatccaatg tttggatgag 5160 
caaggctcct tatcttttct tcgttaatgt taatcacact cttttggatg gagacaaata 5220 
tctgtggggg ctnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5280 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 534 0 
nnnnnnnnnn nnggctcaga ctaaataatg tctaatctct tctccagtaa aacaatccgt 5400 
ggttctcaga tagcactgtg ctggaggtag tggggtttga gggctgggaa gttgggagga 54 60 
ctgagccctt cccgctgagc agtttcgtcc agtttttcct gtaccagcct gtcatgttta 5520 
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ttccatgtga atgactccag aggcaaaatt caagcttttg aatagggcac aaattaactt 5580 
gagtaccctt tcatttccct gtaggtgaac actcctctag ccctgccttt tgtcagtctg 5640 
gagcccttgt tctaatctgt acacaccaga ggactttaca aggctttccc cagcctccag 5700 
aattattctt ctgatccacc ctctactaaa ctcacccttt cctcagtgct aggacgttga 5760 
aaaaccgaaa caaggcaaag ggccaattgt aataattcac actaaggcat gagtgactag 5820 
gtttagtata ttaacactac ctaggatatt ctatttcttc caaaaggatc ctgttaatcc 5880 
ttgaaattta acaactaatg gtatagattc taagcactgt gagtacttgt cagtggggga 5940 
aagacatttt tgggctgaga gactttgcca ctgnnnnnnn nnnnnnnnnn nnnnnnnnnn 6000 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn aagaaagatg attagatnnn nnnnnnnnnn 6060 
nnnnnnnnnn nnnaaaaaat aacatgagag nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6120 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6180 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6240 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6300 
nnnnnncaca gagctagccg tgttggctgt cacccactca tgtggccagc ctgttggtct 6360 
acctcttagt tgccatgtaa caggattctg gtgcttttcc tttgcccagg t cag atg 6417 
gaa ccc agg acc cat cca act tgg ctg ctt cac att ttc ate ccc tec 64 65 
tgc ate att get ttc att ttc ata gee aca gtg ata gee eta aga aaa 6513 
caa etc tgt eaa aag ctg tat tct tea aaa ggtaagtgag ttttattcat 6563 
ggtaacccaa tgcactgggt gtctgeagea tgageeaetg etttgeactg eaggectatg 6623 
gettgetget ttcatgctaa aeecacteag agettatgaa eeactttgag cttgtcttga 6683 
tgattatttt tceccagaag aaaatggetc teategtcag tgagctgaac ttcttacact 6743 
gagtttttta aagggaatgt tttgttctta tgtctgaaag agtttgtctt attctttgag 6803 
ccaagagctt tcatcagcct catgagagtg atgttatttt ggcaatgcag agagctacgt 6863 
gctccgattt tgetggtggg aggttgccag gatcctttct gaggattect tccattttea 6923 
ccectctttt ccccagtetg gatatgacct gggttaaacc caceccctct cccaggaatc 6983 
tcaacctcac ggttgggtaa ggaaaggaga aaggtttgtg aggecatttg gggataagga 7043 
aacagetggt tggtggtgca ttaaegtett teageagete cettcgagtt teteettagc 7103 
ctgttgtatt cttaccaaca cactcctgtt ctgttgtacc agctgggaca gagcatgetg 7163 
aagectttea gecetgattt cattgettca ttgttcatgt gtetgtcttt ggtttectgg 7223 
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gtggagcctg cccacaaaac ccccagaatg tatgcaggcc tagctggtgc tttcctaaac 7283 
ggctcccttg tctgcactca atgaacttct ccaaagatct atacatggcc tcatctatag 7343 
aaagagaaat gacatgtgga aataattcag taggagtttg cagcagcact atctgaggac 7403 
taggggaatt ttaagtggtt gttatcttac atttatactc ataacttcta tattttcatc 7463 
tgccataaaa tattgtcatg ttctatttgt ccattgccct atgtgtgtat gtattcactt 7523 
gggtgctgac cacaatattt ctaactgtag aatgcaagga attgttgcca aannnnnnnn 7583 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7 64 3 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnncttca tggtaaagga 7703 
ggaggtatgt ccaacagaac ttcgactttt aaatagaacc acttcagaga gttgtgtcag 7763 
gtgcacctca gttgtcttat cttctgccat tcttctttta cctctcacac ccatacctca 7823 
gggttcaagg cctggggcct gaggactcct taataacttc agaaatgagc agctgagtgt 7883 
tccgttccag ctgtctttgg gagaatggaa tggagtcaca ctcaaagata gagtggaaat 7943 
aaatcctctc ctcatccttc accccaatct taagagtgag tgaggatatc agtagctccg 8003 
agctgggagg taaagctcaa gttctaactg tgattaggag acctttctta caaataagaa 8063 
ttaagtgaat aaatgtgcaa acaatttctt ttatattttt aatgaaccag agagaaatca 8123 
tggttgccta tataaccctt gtctccaact cacttgcatt cagatctgct ttcttacatg 8183 
tgtctgccat gcacacaaac ttgtgtgcca tggaaaaggg ttgagaactg ctggtgatgc 8243 
agacagagct ttaaannnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8303 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8363 
nnnnngcagc aagagaagga acattttaca gcttattggc cgaacttcac tgccgctagt 8423 
gtggttcaac ttggactaca gagaaatctt cctaactggt ttccctgtat tcactcctgc 8483 
tacctccaac ttggtctgtt ctcacttttt gctataatag gcttttaaaa atcataaatc 8543 
taccatgtgt cctctgtcca gaccttctcc atggcttctt attgctcact ggatgaagtt 8603 
ccaacgagcc caggatggtt tgactcatgt ctccagcttt aactgcatca ccatcacctt 8663 
cattgtctaa agctctaacc acacaggatt ttctagtcct cagaggcatg gcagtctttc 8723 
aattccgagt tttctcatac aatattgtct cttcttaaaa tattttttct tgttgtccac 8783 
ctgagttgga gtcatctttt aaatctcagc taagcttata cttcatcaag tctttcctaa 8843 
ttctacctcc acgcaccaca cccattacat taaatcccct tattatatgt ttccatagca 8903 
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cctactttct tcttttcagt atactcagca cacaatcaca tgtctaggat ctgttttaat 8963 
agcttggact accaattaaa ttgcatccct tttaattgtc cattgattcc tcaagtaccc 9023 
acatgcccat cttagcaaga agttcagtgt ctccctctta tagcatgtac ttctccacct 9083 
cccacaaact gccagaaagc ttacttagcc cacagggcca gtgctaggca gctaggttag 9143 
tcctccagag ggccctggtt ttgagcagtt gctgtctact ccggccatgc agaatctctg 9203 
gtccttccag atgtctccat ccactgtgca aaggtaacct tgctggttcc gatccccaca 9263 
cagaccacag tgctacaaga ttacagttct tatggttccc caacacatgc tctgtcattg 9323 
gtcccaaagc aggaccccta tgggttgatg aggtaggagg aggtccctgc cttagccaca 9383 
gctgcacaca gccagcctct tcccttctag gccctcatgt tgagcctggg acgccagtcc 9443 
taacttcctt ctcttcagtt cctcttaggg ccattggtat cctgaatttc ttagtccatt 9503 
gcaaagttaa gtaaagaagc agcaggcttg gtccctttcc ttccagatgg cttcttagct 9563 
cctgaacaga tttacccacc tatacctcag tgactagctc tgtgtactaa agtgtattgg 9623 
gagggcagcc attattggtc cataaaaggt cctgcttacc attttcccct aagaggaacc 9683 
attcaacagt ttggggctcg agggtgacct gctgggctct agagaagaag ctggcaactt 9743 
ctgttgcaaa ataatgttaa attctgcttc atctgcttgt cttnnnnnnn nnnnnnnnnn 9803 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9863 
nnnnnnnnnn nnnnnnnnnn nntaaacatt gaacctacta tatgcaggtg agtatgctag 9923 
atnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9983 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10043 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10103 
nnnnnnntgn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10163 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10223 
nnnnnnnnnn nnnnnnnnga tttgaataca ggtctgtttg actccaaaac ttgtggccta 10283 
tttgttgcaa aagtgcttaa tacaaattgg ttcagtcaat attattatct ttgaacaatg 10343 
gaaggagaaa gtaagtttca atccaaaata attgagtgac ttatacattg acttgctgag 10403 
ccaatggcaa agtcaagtta gaatccagca gaagtcacca gctacagaat ctagatcttt 104 63 
agaacatgtc ttcagatctt cagaacagtg tttcttaaac tctcttgtga aggaacagtt 10523 
atcatcatag gctggtaaca gttcacctac cagcaccagc ccatgaacca gactctaagt 10583 
ggcacagccc tagaagattg agccagaatt ttacagaggt ttaaagacca aatatgctgg 10643 



tttatggtta cctgtggccc acagagaatg gcagcactaa cctcaggcat aaatgaggta 10703 
cccactgaag ccaacattca agagcaattc ctatgggtta accattgggc tcctttcaaa 10763 
tgcaaaccct catgaaagag actacagtgc tgaatagaga cctccaaatt ccaggccaag 10823 
ctcaggatag tcatgaggga attactaaaa acctggtata tagggcaaaa gcagaattag 10883 
gaatggactg atttcaggaa cccaggcaat ggcaggagtt gggcattaaa tcctaaaaga 10943 
gaatcagagt gggagggaat atgtgaaatc agaggttaag aaaaaagtga aaacctnnnn 11003 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11063 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11123 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11183 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11243 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11303 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11363 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11423 
nnnnnnnnnn nnnnnaaaaa attaaaagaa agatgtgaaa tcaaggaaac ttactggtga 11483 
gcagcatccc attatgtgaa cttgtgcttc tgaaccagta acttgagtta ctttgagcca 11543 
gtatcagtca cttatacctt agtgcaaaat taattgatca gacattctga cctggaccag 11603 
ggaaggcagg cagaagtagc agtcaagact aaagcagaaa agggagagct aattctgcag 11663 
ccagacattt cctggattga atacccaaat tagtccctca gcctttaagt gcctgagggc 11723 
caggagtaga cagaggaatg gaaagtgtga gacttctttg ttcacactct ttgcctaggg 11783 
gccagatttt gctttatgca ttaccatccg aagtcccagg ccacagtgaa catttgggct 1184 3 
tcgctatgtg gatttattta gatttacttt ttgtcctgcc atattttaat ctataagcca 11903 
aacagttttc tcattaatct tattccattt ctggaatttt tccttttca gac aca aca 11961 
aaa aga cct gtc acc aca aca aag agg gaa gtg aac agt get 12003 
gtgagtaagc atgattttta cttttctttc ttactttctt ttctctctca gcttgaattt 12063 
taaagtaacc actgttctat taattcatgg aaggcaactg aatagttcca gcttatagaa 12123 
tcttcctgtt tggtagcatt teagegaage ctegttctta gccccagaac aatcatgeca 12183 
tettttgetc ggtctatatt cctaagcact cctagatgat actgeactgg acctctggtc 12243 
tcacatagtt agaaacagag ttaaaatcga acagcaaaga gaagatattc aactgegatg 12303 
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caattgacaa tggatgtttt tgcaacaaac aatgattaag aagtacattg ttgtgggctc 12363 
tgagtcaaga gtaatatggg aaaaacacaa gtctcttcat gaggttgaca ggtttggagc 12423 
tggaatctgt ggaggaggaa ggatatgatc taggggtcag aagaagtggg ttactaaaat 12483 
cattaagcct ggttggatga aaagcttaga ctcaggggaa gcagnnnnnn nnnnnnnnnn 1254 3 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12603 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12663 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12723 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12783 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1284 3 
nggcaccaag agatgagggg ggcagtcttg gccatatatt tggctgaagt aagtcaattt 12903 
gtcattcctg catgagcctt tataaacaga agtaagtaac caactactat ttggtcattg 12963 
gagttgtcca agaggccagg gttctgtcta atacctgttc atgcatgaac atgccaacct 13023 
agattgcatg cagactacca gttttgggtt tttgtttagt tcagcaggat ttttctcagc 13083 
tcactgcctc tcaaactttc agcaacaaaa ggacatctgt gatatcagaa tctaccactc 13143 
taagtatttg gatgcaatag caatgaatat ctgagtaaat ctaggtgggg agtgggggca 13203 
ccctgtagcc aaaatgattt aacaaaatca aaccaaaatt ttggaaatga tgccttggta 13263 
caatgaagag actacttgag gtaggtttga cttatctaat atcttatttt ctttaccaat 13323 
acctaatgag gaatttaaat atttctagat agctttggaa aggtccctta aagaggcacc 13383 
agcataccac tgccagatct aatcccccca aacactgttt tcatcatcat catgtcatct 13443 
cttgtctcta tagatcatat caaatccttc ccagagtttt tcaggccttt tgacaactag 13503 
ccacatttca ctaagccaac tcatctacca ctcttcaaca aaacttttcc tcaagttgag 13563 
ctgctccacc aacaccactg ccatgagctc attcccactt ctgtggcttt gctcatgttg 13623 
gttatttttt tggagtgtcc tccctattcc ttcttacttg tcccaatccc aacttttggc 13683 
atggtctact ttaagataca gtaatgagta actttannnn nnnnnnnnnn nnnnnnnnnn 137 4 3 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13803 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 138 63 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13923 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn tttttaaaat ttaattcaan 13983 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14 043 



# 



nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14103 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14163 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14223 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nntacataca 14283 
caacactctc ttgcttaatc tgtaagactc tctcccccac tcataccttt ttatttttcc 14343 
tctgcattgt acacacaatc tataccactc ttaagcacat gattacagcg ttattttctg 14403 
gctgcttcta tgtgtctata ttttaggtcc acctggtcaa tataataaag tgggatatta 14463 
gtgttaatgc aactatatgg tatttgatat ttgtctttct gtccgtttat caatgtttct 14523 
tatagnnnnn nnnnnnnnnn nnnnnnnngg tgtctgattt tgaccaaatt tgactaaata 14583 
cnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnngt tgagtctatc aactcaaaaa 14643 
agaataacct acaacaataa caagtttcag aacatttttt aaattactga ttttatgagn 14703 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnngaat ccattgccta 14763 
gaaattgcca tggttaagat tttaatattg ctcaggccca gacagctcag ggctttgaca 14823 
ttcccacacc cattctctgc catcccagtt ctatctcatc ccaaaaccat ccattatgag 14883 
gagagtgtac agctctaggc tgcccgggag ccatcccgca ctctcatttt gtgactcggc 14943 
atcttgggag atggagtctt gggacttagc ctggacatgt cccttgcatg tacttcttac 15003 
aagactttta ttcagatgaa tattttccct tccaacttaa gaagcacagg gcttgctggt 15063 
tttgcttcac taaccagcaa ctgaagcaag acctgacttg tgaaaatgcc taatagagtt 15123 
cagtattagc gctgtttcac catttcctgg atctcttgcc tttgtgcaca tgatagaatt 15183 
gcacttctct gtgattaatt tgtagttaag tgtggtcatg taactcgctt tggtcaatta 15243 
aatgtaagca taagtgatgc gtgttatttc tgggtagaag atgtaagagt tggcatatgc 15303 
tttgccatat tttctttatc catctggcat ggtaaccagt aacattctag gtagtaattg 15363 
ctccatcagt ctcagtctct gagtgactaa aattgacaga gtcccctgct gaccctcaat 15423 
gtacatggaa catgaacaag aataagcttt tgttttttat attgagattt tggagttgtt 15483 
tgttcctaca gcattaccta gtttactcta atacaacatg gaaaaaactg gaacctataa 15543 
taaatagacc ctacgttgcc atttaaactt ctagttctga ggaataataa tgtggggaaa 15603 
tactttctat ataataaaaa aatagaaaat tgcaaaataa aaatatactt atgtatcatt 15663 
catgtcctat taaaaatgtt atttatagac tcaccatatt cccttcctcc agaaaaatag 15723 
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aagtaaaaat atgaaaatgc ctgtaatcat gtttttggat tatggaatca agtattgctt 15783 
tttactttta tgttttctga atttttgttg tacttcacta catttttgag tgccctgatg 15843 
tattactttc aaaaagaaga agaatacttt ctgaagccat ttcaaccatc cccactcacc 15903 
tctctagatc ccagtaacca aatacattat ataggactct tcatcagtcc ttatcaagtt 15963 
taggaagggc gatgctatac cttctttaaa ggacacctac caatgtctta gttgcctttc 16023 
aaagactcct agcacagcta aatgtgatgg atatgctcta aggatataag agctgaagtg 16083 
acttgcataa ggtcatatca taacttactg ttagaaatgg agctagaact cagacccact 16143 
gagtccttgt ctgtgacaca ctgccctttc catttgtgga agttgttctt gtatctaact 16203 
ttatctgtgc tactatttgg gcctagccat tctccctctt atgcagacaa gcagataaac 16263 
agtaaaactt taggagtgga ttatgatacc atagatatat atcatctatc ctttacaaaa 16323 
tagttattac agtcatcaag ccttggttag agtttacaga ccatgtatcc tagctannnn 16383 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnntgaaaca 16443 
gtagcagacc aaaagaagtc atgattccca gcatagtgct nnnnnnnnnn nnnnnnnnnn 16503 
nnnnnnnnnn nnnnaaacct gtattaagtt tctgttattt gcaagaccct gtcagttagg 16563 
ccatgtggga actagaagga tgaatttatc agtcatccaa gattcttaca attaagtatt 16623 
accgataagg tactcaagaa acagttctca ttcacataat ttgggttaaa acaaaaagaa 16683 
gccagctttc tatatacttt tggtccagtc tttacgtttt ttgttttgtt ttgtttgttt 16743 
tcatgagtat cccgacttcc ttctaagaac ttccacctga gaactgacca cagcgtcagc 16803 
attccacatg ggtgtgtttc ctttcccctt tcccatttca gtggtttcca atttcttttt 16863 
cttttggcac tataaacctt tcgcaaagga aatattagac agaactccta catgtcaagc 16923 
aaattaaaat agtggtgaaa ttagagtgga ggacataatc accctatcat ataggctatt 16983 
tgtccatatc atatttgtcc ctacaaaggc ctctaaggnn nnnnnnnnnn nnnnnnnnnn 1704 3 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 17103 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 17163 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 17223 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1728 3 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1734 3 
nnnnnnnnnn nnnnnnnnnn nnnctctaag ataccttgtg cttcttggaa catatttgga 17403 
aaatcatgta gctctcaaat tatccctatg tcctgaagcc caccttacca tcaatcctca 17463 



# 



gaaataccca 


acccatgtca 


ggcaacttca 


tcagggaggg 


aggagagtgc 


tattagcaag 


cccatgaatt 


tttaatggtg 


tgaaaagtct 


ataagaggga 


caaacaatac 


atgaaagttc 


gggataatca 


gggaagaaga 


aagttttcat 


tggaaacagc 


aaactcttct 


agatagagga 


aggtgtggga 


aagtgcacgt 


gaccccgttc 


aaagactgtt 


agggcatgta 


ttgtaaacca 


actaagcaag 


agtgcttcag 


ttagatccta 


taactgaggc 


aagtttaatt 


tataaaggtg 


tactgagcat 


gccaggatgc 


aagagcaggc 


agaaagggga 


gggagcagtt 


cccagaaccc 


tggcaggacc 


tatggtcttt 


aacagaggga 


tgctgaggaa 


tagatacccc 


aacctctctc 


gactgagccc 


agtcaaagac 


agagggagga 


cctggagcat 


gaatagagtg 


cagcagggtg 


atctgacaca 


atagggaact 


ataagaggtt 


caatattact 


gaactatgtg 


tggcccaaag 


ctgaggagag 


aagctgaaga 


cttggactag 


taaatgaaaa 


gttttaaacc 


atgcggcttc 


ttcctcactc 


aaattttggg 


gaggttatat 


atttgtgggc 


ttttctcccc 


ag ate tga 


atatgacatc 


taaagaagct 


tctggactct 


ccatttgcac 


ttttcaaatg 


cctttggatg 


agactagcca 


acacctggcc 


atgaaacttg 


gcctatggct 


ttaagcaagc 


actactgcac 


ccacagaatt 


ccttcaggat 


ccttcttgct 


cctcaagttt 


tctaagtgat 


ttccaaaagc 



cactttcttt 


cttcaggcag 


cacagttgtc 


17523 


aggagtcact 


aacagcttca 


ctcacctgtg 


17583 


gtgtattttt 


tatggttctc 


tatggctcat 


17643 


agagatgggg 


acaaatatca 


ctttaagctg 


17703 


ggagaaggtg 


acttttgaat 


caggaagaaa 


17763 


gacactaaca 


ggaaaggcag 


agaggcagga 


17823 


agagagaaag 


ccaggtgtga 


gataaggggg 


17883 


ctaattccag 


gcaaaagtta 


gattttactt 


17943 


gcaggaaatg 


gagggtatgc 


ttagaagagg 


18003 


tgtgcagcat 


taagggaaac 


cagcaaggga 


18063 


agggaaggtg 


actattccta 


ggtctgaagg 


18123 


tagtaaaaat 


ggcaatgaga 


aaggtccatc 


18183 


caaagtcaac 


ccacaacttg 


tctgggaggt 


18243 


tcaacccact 


gcaacactct 


ttttccccta 


18303 


geccagtgat 


geagtctgea 


atgtcatcat 


18363 


aataatgagt 


ctgcaggaat 


taatagaaat 


18423 


ttgaatagga 


gaggcccctg 


aaatgtgctc 


18483 


aatggaagag 


gaacagctct 


tgcaataggt 


18543 


ggcaatggta 


aaaactgtgg 


aaagaagttt 


18603 


cagctagatg 


aactttttta 


aaaaaattag 


18663 


attttctaat 


cataaaaaat 


gatttttctt 


18723 


acctgtggtc 


ttgggagcca 


gggtgacctg 


18781 


gaacaagaat 


tcggtggcct 


gcagagcttg 


18841 


acccagcact 


ttaatctgaa 


acctgcaaca 


18901 


ccccttcact 


gatctggact 


cacctctgga 


18961 


tttacagaat 


taccccactg 


gatcctggac 


19021 


gecagactga 


aagcaaaagg 


aattatttcc 


19081 


agaggtgtgt 


ggaaatttcc 


agtaacagaa 


19141 
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acagatgggt tgccaataga gttatttttt atctatagct tcctctgggt actagaagag 19201 
gctattgaga ctatga 19217 

<210> 19 
<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 19 

Met He Phe Leu Leu Leu Met Leu Ser Leu Glu Leu Gin Leu His Gin 



<210> 20 
<211> 102 
<212> PRT 

<213> Homo sapiens 
<400> 20 

Ala Leu Phe Thr Val Thr Val Pro Lys Glu Leu Tyr He He Glu His 



Gly Ser Asn Val Thr Leu Glu Cys Asn Phe Asp Thr Gly Ser His Val 
20 25 30 

Asn Leu Gly Ala He Thr Ala Ser Leu Gin Lys Val Glu Asn Asp Thr 
35 40 45 

Ser Pro His Arg Glu Arg Ala Thr Leu Leu Glu Glu Gin Leu Pro Leu 



Gly Lys Ala Ser Phe His He Pro Gin Val Gin Val Arg Asp Glu Gly 
65 70 75 80 

Gin Tyr Gin Cys He He He Tyr Gly Val Ala Trp Asp Tyr Lys Tyr 



Leu Thr Leu Lys Val Lys 
100 



<210> 21 
<211> 91 
<212> PRT 

<213> Homo sapiens 
<400> 21 

Ala Ser Tyr Arg Lys He Asn Thr His He Leu Lys Val Pro Glu Thr 
15 10 15 

Asp Glu Val Glu Leu Thr Cys Gin Ala Thr Gly Tyr Pro Leu Ala Glu 
20 25 30 
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Val Ser Trp Pro Asn Val Ser Val 
35 40 

Thr Pro Glu Gly Leu Tyr Gin Val 
50 55 

Pro Pro Gly Arg Asn Phe Ser Cys 
65 70 

Glu Leu Thr Leu Ala Ser He Asp 
85 




Pro Ala Asn Thr Ser His Ser Arg 
45 

Thr Ser Val Leu Arg Leu Lys Pro 
60 

Val Phe Trp Asn Thr His Val Arg 

75 80 

Leu Gin Ser 
90 



<210> 22 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<400> 22 

Gin Met Glu Pro Arg Thr His Pro Thr Trp Leu Leu His He Phe He 
15 10 15 

Pro Ser Cys He He Ala Phe He Phe He Ala Thr Val He Ala Leu 
20 25 30 

Arg Lys Gin Leu Cys Gin Lys Leu Tyr Ser Ser Lys 
35 40 



<210> 23 
<211> 17 
<212> PRT 

<213> Homo sapiens 
<400> 23 

Asp Thr Thr Lys Arg Pro Val Thr Thr Thr Lys Arg Glu Val Asn Ser 
15 10 15 



<210> 24 
<211> 11 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 24 

Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
15 10 



<210> 25 
<211> 15 
<212> PRT 

<213> Artificial Seque 
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<220> 

<223> Description of Artificial Sequence: Internalizxng 
domain derived from HIV tat protein 

<400> 25 

Gly Gly Gly Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
1 5 10 15 



<210> 26 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide Primer 2515-27 

<400> 26 

cataatagag catggcagca atgtgac 



<210> 27 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide Primer 2524-63 

<400> 27 

gggtcctgga gtggctggtg ttg 



<210> 28 
<211> 14 
<212> PRT 

<213> Homo sapiens 
<400> 28 

Leu Phe Thr Val Thr Val Pro Lys Glu Leu Tyr He He Glu 
15 10 



<210> 29 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide Primer 2515-24 

<400> 29 

gtggctcttt cacggtgtgg ggatg 
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<210> 30 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide Primer 2538-68 

<400> 30 

ccagtgtcaa agttgcattc cagggt 
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